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(57) Abstract 

The present invention relates lo expression veaors 
comprising nucleic acid sequences which encode an affin- 
ity ligand (e.g., an enzyme, epitope) and a modification 
recognition sequence. The vectors funhcr comprise , at 
least one resiriaion site for the insertion of a nucleic add 
sequence capable of encoding a selected polypeptide. On 
expression, the resulting consirua codes for a fusion pro- 
tein comprising an affinity ligand, the seleaed polypep- 
lide and a modification recognition sequence. The fusion 
protein may be isolated by virtue of the affinity ligand 
and then modified. The expression veaors may further 
comprise a nucleotide sequence encoding a cleavablc link- 
er, such as a thrombin or factor Xa cleavage sequence. 
Tlic invention further relates to expression veaors con- 
taining a nucleotide sequence encoding a gene for a se- 
lected polypeptide and capable of direaing the expression 
of the selected polypeptide as a fusion protein. In addi- 
tion, methods of producing a modified fusion protein are 
disclosed. Modified or labeled fusion proteins of the pres- 
ent invention are useful in a variety of therapeutic, diag- 
nostic and research applications. 
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PLASHIDS FOR THE RAPID PREPARATION OF 
MODIFIED PROTEINS 

Description 



Background 

05 Current methods for the production of labeled 

proteins include iodination, biot inylation , or the in 
vivo labeling of protein. In vivo labeling methods 
require large amounts of radioactivity are required, 
and the protein of interest must be separated from 

10 other labeled cellular proteins- The biotinylation ' 
and iodination procedures also require a purification 
step for each individual protein and, in some cases, 
reaction conditions which can cause inactivation of 
the protein. In addition, using these methods, 

15 modification of a protein may occur at a variety of 
sites, leading to distortions in structure or • 
biological activity. 

For example, iodination protocols relying on 
chloramine T or iodogen result in modifications at 

20 tyrosine residues and some histidine residues. 

Over-substitution and. oxidation damage may result. 
Labeling procedures which use the Bolton-Hunter 
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reagent result in the modification of free amino 
groups on lysine residues. In some proteins this 
particular modification may also have a deleterious 
effect on structure or activity. Although the 
lactoperoxidase- method for iodination emoloys gentler 
conditions, this method also leads to modification of 
tyrosine and histidine residues, with the potential 
for structural distortion and loss of activity. 

Similarly, biotinylation protocols are 
frequently performed using a succinimide ester of 
biotin. The biotin is coupled to the protein through 
free ammo groups, typically on lysine residues. 
Again, modification at one or more positions may 
alter structure and/or function of the protein, m 
15 addition, extensive dialysis is needed to "remove 
uncoupled biotin, which may be deleterious to the 
protein. 



Summary of th^ invention 

The present invention relates to expression 
20 vectors comprising nucleic acid sequences which 
encode an affinity ligand (e.g., an enzyme, epitope) 
and a modification recognition sequence. The vectors 
further comprise at least one restriction site for 
the insertion of a nucleic acid sequence capable of 
encoding a selected polypeptide in frame with the 
affinity ligand and modification sequence. On 
expression, the resulting construct codes for a 
fusion protein comprising an affinity ligand, the 
selected polypeptide and a modification recognition 
30 sequence. 



Qi iRcm-n rn= SHEET 



wo 93/03157 PCT/L'S92/06187 



-3- 



The fusion protein may be isolated by virtue of the 
affinity ligand and then inodified. 

The expression vectors may further comprise a 
nucleotide sequence encoding a cleavable linker, such 

05 as a thrombin or factor Xa cleavage sequence. The 
sequence encoding the cleavable linker is located 
between the affinity ligand and the restriction site 
for insertion of the sequences encoding the selected 
polypeptide. In this location, the linker may be 

10 cleaved to release the protein of interest . following 
modification- The invention further relates to 
expression vectors containing a nucleotide sequence 
encoding a gene for a selected polypeptide and 
capable of directing the expression of the selected 

15 polypeptide as a fusion protein. 

The PGEX-2TK expression vector is one embodiment 
of the present invention- This vector encodes a 
protein comprising, from amino to carboxyl terminus, 
glutathione-S-transf erase (GST) as an affinity 
20 ligand, the thrombin cleavage site as a cleavable - 
linker, and a phosphorylation recognition site for 
the cAMP-dependent protein kinase as a modification 
recognition sequence. A multiple cloning site 
comprising three restriction sites is located 

25 

downstream of the sequence encoding the 
phosphorylation site. Thus, a nucleic acid sequence 
encoding a selected polypeptide may be inserted into 
the vector using one or more of these sites. The 
selected polypeptide is expressed as a GTK-fusion 
protein (G, GST; T, thrombin; K, kinase). 
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in addition, n>ethods of producing a n,odified 
fus.on protein are disclosed. Upon expression in a 
suitable host cell, the protein of interest is 
produced as a fusion protein. The fusion protein can 
be captured on a suitable affinity .atrix by virtue 
Of an. affinity ligand, which interacts reversibly 
with the n.atrix. Modification of the fusion protein 
c.n be carried out while the protein is attached to 
the matrix. Subsequently, the n,odified fusion 
protein »ay be isolated for use by releasing the 
fusion protein from the affinity matrix with a 
suitable agent, m the case where a cleavable linker- 
's present, the fusion protein may be cleaved in 

13 2TJ° polypeptide portion, and 

the aff.n.ty ligand portion or any uncleaved product 
can b adsorption on the appropriate 

affinity matrix. Alternatively, the modified 
polypeptide portion can be released from the affinity 
l.gand portion by, cleaving the modified fusion 

:r::L:r" "^^^"^ ^^^^^ --^ 

Modified proteins of the present invention are 
usefu ma variety of applications. Labeled (e g 
25 proteins may be used as molecular 

tZrll' J" antibodies can be labeled for 

purposes. Proteins may be labeled and used as 
reagents to guantitate or identify an interacting 
protein, such as a receptor. ^ 
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Brief Description of the Drawings 

Figure 1 shows a portion of the nucleotide 
sequence around the kinase recognition site of 
expression vector pGEX-2TK. The portion of the 

05 PGEX-2TK sequence introduced by the synthetic duplex 
is indicated by italics. The sequences which encode 
the linker cleavable by thrombin (Leu-Val-Pro-Arg- 
Gly-Ser) , the downstream kinase recognition site 
(Arg-Arg-Ala-Ser-Val) and multiple cloning site with 

^0 BamHI, Smal and EcoRI sites, are shown. The arrow 
indicates the point of thrombin cleavage. 

Figure 2 is an illustration of the structure of 
expression vector pAR (aRI) 59/60 . The general 
structure of the plasmid is shown at top. bla, 

^5 indicates the ;9-lactamase gene which confers 

ampicillin resistance; ori, indicates the origin of 
replication. In the center panel, the general 
structure of FEK-fusion proteins is illustrated. The 
FLAG peptide portion, comprising the FLAG epitope and 

20 

enterokinase cleavable linker, is indicated. "HMK" 
indicates the location of. the phophorylation site. 
"Protein" 'indicates the location of the selected 
polypeptide. The lower panel shows a more detailed 
illustration of the N-terminal region of the vector. 

?5 

The peptide sequence shown (Met-Asp-Tyr-Lys-Asp-Asp- 

-Asp«Asp-Lys-Ala-Arg-Arg-Ala-Ser-Val-Glu-Phe-) in the 

detail is a contiguous sequence. The extent of the 

FLAG peptide, the enterokinase cleavage site, and HMK 

recognition (phosphorylation site) are indicated. 

^0 Figure 3 is a bar graph showing the effect on 

32^ . ... 

P incorporation m vitro of an HMK sequence on 
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different fusion proteins, comprising portions of the 
retinoblastoma susceptibitliy gene product, RB. 
Proteins were phosphorylated in vitro using 
cAMP-dependent protein kinase. The hatched bars 
indicate that the fusion protein (GST-RB37 9-7 92 or 
GST-RB379-792;pn.706) was expressed from pGEX-2TK and 
contained an HMK sequence, while the solid bars 
indicate that the fusion protein (GST-RB379-792 or 
GST-RB379-792;pn706) was expressed from pGEX-2T and 
did not contain an HMK sequence. 

Unless indicated otherwise, the orientation of 
particular amino acid sequences is such that the 
amino end is on the left and the carboxyl end is on 
the right. 

Detaile d Description of the Invention 

The present invention relates to expression 
vectors comprising nucleic acid sequences which 
encode an affinity ligand and a modification 
recognition sequence. The vectors further comprise 
at least one restriction site for the insertion of a 
nucleic a6id sequence capable of encoding a selected 
polypeptide. The expression vectors may further 
comprise a nucleotide sequence encoding a cleavable 
linker sequence. These vectors are referred to as 
parent vectors. 

The present invention further relates to 
expression vectors which are derived from the parent 
vectors described above, by the insertion of a 
nucleic acid sequence capable of encoding a selected 
polypeptide (e.g., a natural or synthetic cDNA or 
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gencmic DNA which encodes the selected polypeptide 
and is capable of being expressed in the appropriate 
host cell) into the parent vector. The parent 
vector can be cleaved at one or more restriction 
05 sites in order to insert sequences encoding a 

selected polypeptide using known techniques. Linkers 
or other sequences can be used to facilitate 
insertion. Insertion at an appropriate restriction 
site or site in the parent vector results in the in 
^0 frame fusion of the sequences of the selected 
polypeptide with the amino acid sequences for an 
affinity ligand, modification recognition sequence 
and, if present, the optional cleavable linker 
encoded by the parent vector.. On expression, the 
^5 resulting fusion gene encoded by the vector codes for 
a fusion protein comprising an affinity ligand, a 
modification recognition sequence, selected 
polypeptide, and optionally, a cleavable linker. The 
location of the restriction site or sites selected 

2.0 

for insertion in the vector will determine the 
location of the selected polypeptide relative to the 
affinity ligand, modification recognition sequence 
and optional cleavable linker in the encoded fusion 
protein. The selected polypeptide element of the 

25 

fusion protein comprises at least one peptide, 
polypeptide or protein of interest. 

The fusion protein comprising an affinity 
ligand, a modification recognition sequence, an 
optional cleavable linker and a selected polypeptide 
forms a contiguous polypeptide chain. The order of 
these components in the fusion gene and protein can 
vary. The location of these components or additional 
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components in the fusion gene and protein is 
deter:.ined by the order in which the nucleic acid 
sequences encoding the corr.ponents are linked in the 
vector or the parent vector. For exan,ple, fusion 
proteans of the following structure can be :nade 
Where A indicates the affinity ligand, m indicates 
the modification recognition sequence, c indicates 
the optional cleavable linker, and X indicates the 
polypeptide of interest, and the N-ter.inal portion 
Of the fusion is on the left: 

A-(C)-M-X 

A-M-(C)-X 

X-(C)-M-A 

X-M-(C)-A 

M-A-(C)-X 

M-(C)-A-X 

other per„ut.tic„. of th. ==„po„.nts ar. possible. 
However, the cleava.le lin.er is preferably located 
between the affinity ligand and selected polypeptide 

duc^i!:' 7 I'tro- 
ducticn of a selected polypeptide in frame is 

s«pufi., , polypeptide 
(X) . This location can also minimize the effect of 
fusion on the biological activity (e.,., binding 
activity, antigenicity, enzymatic activity, of the 
selected polypeptide. 

^-n-cTx7 T "'^ duplicated (e.g., 

1 K • """'^^P'" modification sites 

can be incorporated to increase the intensity of 
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labeling. These sites may be contiguous or 
noncontiguous in the encoded fusion protein. 
Furthermore, additional sequences (S) can be present 
in the fusion protein. For instance, a sequence 

05 encoding a signal peptide or leader peptide may be 

incorporated into the vector, and the signal peptide 
will be produced as part of the fusion protein. In 
the case of a signal peptide, the additional sequence 
is preferably incorporated at the N-terminus (e.g., 

10 S-X-M-C-A) However, for other sequences, an 

internal or C-terminal location in the fusion protein 
may be desired. 

Note that on expression in a suitable host cell, 
the parent vector may also produce the affinity 

15 ligand, modification sequence, and optional cleavable 
linker sequences as a fusion protein. This product 
may contain additional sequences encoded by the 
vector, depending on the location of a promoter or 
termination signals relative to the coding sequences 

20 for these elements. In some cases, restriction sites 
for insertion of sequences encoding a selected 
polypeptide may disrupt the reading frame of . 
components encoded by the parent vector. Insertion 
of sequences encoding a selected polypeptide will 

2^ restore the reading frame. 

The expression vectors of the present invention 
can be designed for use in a variety of host cells, 
including bacterial host cells such as E. coli and 
eukaryotic host cells (e.g., yeast cells and 

30 mammalian cells). Thus, fusion proteins can be 

glycosylated. Like other expression vectors, in the 



SUBSTITUTE SHEET 



wo 93/03157 PCT/US92/06187 



-10- 



05 



10 



15 



20 



25 



30 



expression vectors of the present invention, a 
promoter is provided for expression of the fusion 
protein in a suitable host cell. Suitable promoters 
can be constitutive or inducible. In the vectors, 
the promoter is operably linked to nucleic acid 
sequences encoding the fusion protein and is capable 
of directing the expression of the corresponding 
polypeptide. A variety of suitable promoters for 
procaryotic (e.g., lac promoter, tac promoter) and 
eukaryotic hosts (e.g., yeast alcohol dehydrogenase 
(ADHi) , SV40) are available. 

In addition, the expression vectors typically 
comprise a selectable marker for selection of host 
cells carrying the plasmid and an origin of 
replication, in the case of a replicable expression 
vector. Genes encoding products which confer 
antibiotic resistance are common selectable markers 
and may be used in prokaryotic (e.g., ^-lactamase for 
ampicillin resistance, tetracycline resistance) and 
eukaryotic cells (e.g., G418) . Genes encoding the •• 
gene product of auxotrophic markers (e.g., LEU2 and 
URA3) are commonly used as selectable markers in 
yeast, use of • viral or phage vectors, and vectors 
which are capable of integrating into the genome of 
the host cell, such as retroviral vectors, are also 
contemplated. The present invention also relates to 
cells carrying these types of expression vectors 
(e.g., transformed cells). 

The incorporation of one or more modification 
recognition sequences into the fusion protein allows 
the production of modified fusion proteins. As shown 
m the Examples, incorporation of a detectable label 
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(e.g., radioactive, fluorescent) at the modification 
site provides a convenient way to label the fusion 
protein. Thus, the invention further relates to the 
fusion proteins and modified (e.g., labeled) fusion 

05 proteins produced from expression vectors of the 
present invention. It is also possible to produce 
fusion proteins of the present invention using 
methods of peptide synthesis, or in vitro 
transcription/translation procedures . Modification 

10 of synthetic fusion proteins is also possible. 

The affinity ligand present in the fusion 
proteins of the present invention is a polypeptide 
encoding an affinity ligand. The affinity ligand 
(i.e., an affinity ligand or portion thereof) is a 

i5 member of a specific binding pair and is capable of 
binding to a specific binding partner. Preferably, 
the binding of the affinity ligand to its specific 
binding partner is reversible, allowing recovery of 
fusion proteins comprising the affinity ligand where 

20 desired. Examples of affinity ligands include, but " 
are not necessarily limited to, antibodies or 
portions thereof, antigens (e.g., influenza 
hemagglutinin) or epitopes (e.g., FLAG epitope), 
enzymes (e.g., glutathione-S-transf erase (GST), 
^-galactosidase ( lacZ ) , the trpE product), hormones, 
growth factors or other proteins capable binding to a 
specific binding partner (e.g., maltose binding 
protein, histidine hexamer (a heavy metal binding 
element), and protein A). A specific affinity ligand 

30 is an affinity ligand comprising that protein or 
peptide or a portion thereof- Thus, a 
glutathione-S-transf erase affinity ligand is an 
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affinity ligand comprising glutathione-s-transf erase 
or a portion thereof, which is capable of binding the 
specific binding partner (e.g., a substrate). The 
affinity ligand portion of the fusion protein is 
05 useful for a variety of purposes, such as providing a 
moiety for attachment to a support or to facilitate 
purification or identification. 

For example, the fusion proteins can be 
captured on an appropriate affinity matrix. An 
10 affinity matrix is a solid support to which is 

attached (preferably covalently) a specific binding 
partner. Fusion proteins of the present invention, 
comprising an affinity ligand, can bind to a specific 
binding partner via the affinity ligand part of the 
15 fusion protein. In the case of an affinity ligand 
which is an antigen, an antibody or portion thereof 
can be used as a specific binding partner in an 
affinity matrix. Alternatively, an antigen or hapten 
may be incorporated into an affinity matrix, and used, 
to capture fusion proteins comprising an antibody as 
an affinity ligand. A number of affinity 
ligand/affinity matrix pairs are available. 
Glutathione-s-transf erase fusion proteins may be 
captured on immobilized glutathione as an affinity 
matrix, with glutathione as the specific binding 
partner. For example, glutathione sepharose 
(Pharmacia) or glutathione agarose beads (Sigma 
Chemical Corp.) can be used. Fusion proteins 
comprising a protein A, maltose binding protein, FLAG 
. peptide, or a histidine hexamer affinity ligand can 
be captured on IgG Sepharose 6FF (Pharmacia) , anylose 
resin (New England Biolabs) . anti-FLAG Ml antibody or 
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anti-FLAG M2 antibody affinity resins (International 

2 + 

Biotechnologies, Inc.)/ or a Ni affinity resin (NTA 
resin, Qiagen) , respectively. In the foregoing, IgG, 
amylose, anti-FLAG antibodies, or a heavy metal are, 
respectively, the specific binding partners. 

When desired, a fusion protein or modified (e.g. 
labeled) fusion protein bound to an affinity matrix 
can be released by contacting the affinity matrix 
with bound fusion protein thereto with a suitable 
elution buffer comprising one or more release 
components. The release component or components can 
be molecules which compete with the fusion protein 
for binding to the affinity matrix (e.g., hapten, 
free peptide epitopes, substrate or substrate 
analogs) , or which can disrupt binding of the 
affinity ligand to the specific binding partner. For 
example, an elution buffer (a buffered solution) 
suitable for releasing fusion proteins comprising a 
glutathione-S-transf erase affinity ligand can be 
formulated comprising reduced glutathione as a 
release component. In one embodiment, an elution 
buffer comprising reduced glutathione, Tris and NaCl 
is used. 

Alternatively, the elution buffer may comprise a 
buffered solution which lacks a specific component 
required for binding. For example, a fusion protein 
with an ompA signal sequence followed by a FLAG 
affinity ligand at the amino terminus can be 
expressed in £. coli . Specific removal of the ompA 
signal sequence upon secretion into the periplasmic 
space results in a fusion protein with an 
N-terminally located FLAG affinity ligand, which is 
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capable of binding to the anti-FLAG Ml antibody in 
the presence of calcium. This type of fusion protein 
can be eluted from an anti-FLAG Ml antibody affinity 
matrix using an elution buffer which lacks calcium 
a specific component required for binding. 

The expression vectors of the present invention 
can further comprise a sequence encoding a cleavable 
linker. A nucleotide sequence capable of encoding 
such a cleavage site can be incorporated into an 
expression vector using known techniques (e.g., 
recombinant DNA and/or mutagenesis), a cleavable 
linker is a peptide or polypeptide capable of being 
cleaved by a site specific protease. For example, a 
thrombin cleavage (e.g., Leu-Val-Pro-Arg-Gly-Ser) , 
factor Xa cleavage site (e.g., Ile-Glu-Gly-Arg) or 
enterokinase cleavage site (e.g., Asp-Asp- 
Asp-Asp-Lys) may be used as a cleavable linker. The 
appropriate protease (thrombin, factor Xa and enter- 
okinase, respectively) can be used to cleave a fusion 
protein containing a corresponding cleavable linker. 
Cleavage by a protease (proteolysis) can occur within 
a cleavable linker or at the border of the linker and 
another component of a fusion protein. 

The product of the cleavage reaction of a 
selected fusion protein comprising the majority of 
the affinity ligand is referred to as the affinity 
ligand portion, and the product of the cleavage 
reaction comprising the selected polypeptide is 
referred to as the selected polypeptide portion of 
the fusion protein. These portions may comprise 
other parts of the encoded fusion protein (i.e., the 
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fusion protein encoded by the expression vector) , and 
such are also fusion proteins. For example, the 
selected polypeptide portion can include the 
modification recognition sequence. If so, and if the 
05 fusion protein has been modified (e.g., labeled), 
then the selected polypeptide portion is referred to 
as the labeled selected polypeptide portion. Thus, 
the term fusion protein refers to a protein, 
polypeptide, or glycoprotein comprising at least two 
10 of the components selected from the group consisting 
of an affinity ligand, modification sequence, 
cleavable linker sequence, selected polypeptide or 
additional sequence . 

Preferably, a cleavable linker is selected which 
15 does not cleave elsewhere within the fusion protein 
(e.g., in the selected polypeptide, affinity ligand 
or modification site) . In addition, the sequences 
encoding the cleavable linker are preferably located 
between those sequences encoding the affinity ligand 
20 and selected polypeptide. In this location, the 

encoded fusion protein can be cleaved to separate the 
affinity ligand portion of the fusion protein from 
the selected polypeptide portion where desired. 
Similarly, where recovery of a labeled selected 
2^ polypeptide portion is desired, the sequences 

encoding the cleavable linker will be on located in 
the vector either upstream or downstream of the 
sequences encoding both the modification recognition 
sequence and the selected polypeptide (e.g., as in 
30 A-C-M-X or X-M-C-A fusions). 

Overlap of the sequences of portions of the 
fusion protein may occur provided the respective 
portions retain function. For example, in one 
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embodiment of the present invention, pAR(ARl) 59/60, 
discussed in more detail below, the affinity ligand 
(FLAG epitope) sequence and cleavable linker sequence 
(enterokinase site) overlap by one amino acid. The 
sequence of the FLAG epitope from N- to C-terminus is 
(Asp-Tyr-Lys-Asp) , while the enterokinase cleavage 
site in this embodiment is (Asp-Asp-Asp-Asp-Lys) . 
Within PAR (^Ri) 59/60, these sequences overlap to give 
the FLAG peptide, an octapeptide of the sequence 
Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys, which retains the 
functions of the affinity ligand (e.g., binding to- a 
specific binding partner) and cleavable linker (e.g., 
cleavage by enterokinase) . Similarly, other 
components of the fusion protein can overlap, 
providing their function is preserved. 

A modification recognition sequence is also pro- 
vided. This sequence, incorporated into the fusion 
protein, directs modification of the fusion protein. 
Modification recognition sequences can be 
incorporated into a fusion protein comprising a 
selected polypeptide which either is naturally 
modified or is not naturally modified, a 
modification recognition sequence for phosphorylation 
(i-e., a phosphorylation site) can be introduced into 
a vector and expressed as part of a fusion protein, 
for example. In the presence of a suitable protein 
kinase, the fusion protein comprising the 
phosphorylation (kinase) site will be phosphorylated. 
Other types of modifications directed by the presence 
of a peptide or polypeptide sequence are envisioned 
in the present invention. For example, glycosylation 
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reactions can be directed by a short peptide 
sequence. Similarly, fatty acylation of some 
proteins is directed by a short peptide sequence 
(Cys-Ala-Ala-X) . The identification of additional 

05 recognition sites and the identification and 

purification of the corresponding modification enzyme 
or enzyme complex will provide alternative protocols 
for modification of the fusion proteins. 

Modification of the fusion proteins is carried 

10 out using a modification enzyme or enzyme complex or 
other suitable means. In the case of a 
phosphorylation reaction, a suitable phosphorylation 
method is used. Typically, the phosphorylation of 
proteins is carried out by a protein kinase. Many 

15 such kinases have been described and have utility in 
the present invention (see e.g., Kemp, B.E., et al . , 
J. Biol, Chem.. 252 : 4888-4894 (1977); Edelman, A.M. 

/ Ann. Rev. Biochem . 56; 567-613 (1987); Glass, 
D.B. and E.G. Krebs, Ann. Rev. Pharmacol. Toxicol. 

20 20: 363-388 (1980); Hunter, T. and J. A. Cooper, AnnT 
Rev. Biochem . 54: 897-930 (1985); Hunter. T. , Cell 
50: 823-829 (1987)). Known protein kinases catalyze 
the transfer of the 7 -phosphate group of ATP to the 
hydroxyl groups of serine and/or threonine residues 
on specific substrates or alternatively, catalyze the 
phosphorylation of tyrosine residues. Use of protein 
kinases of both specifities is contemplated in the 
present invention. Serine/threonine kinases include 
cyclic AMP (cAMP) dependent and cyclic GMP (cGMP) 

JO dependent protein kinases, and cyclic nucleotide- 
independent protein 
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kinases. A variety of serine/ threonine kinases have 
been identified including glycogen synthase kinase, 
phosphorylase kinase, casein kinase I casein kinase 
II, pyruvate dehydrogenase kinase, protein kinase C 
05 and myosin light chain kinase. 

Amino acid sequences present in natural 
substrates and artificial peptide substrates which 
are sufficient for activity as a protein kinase 
substrate have been identified (see e.g., Edelinan, 
^0 A.M. et al. . Ann. Rev. Biochem . 56 : 567-613 (1987); 
Glass, D.B. and E.G. Krebs, Ann. Rev. Pharmacol. 
Toxicol. 20: 363-388 (1980); Hunter, T. and J. A. 
Cooper, Ann. Rev. Biochm . 54: 897-930 (1985)). These 
amino acid sequences, in addition to those described 
15 below, and related amino acid sequences which can be 
specifically phosphorylated , can be used as 
modification recognition sequences in vectors of the 
present invention. A corresponding protein kinase, 
capable of phosphorylating a polypeptide comprising 
20 the selected kinase recognition sequence, • can be used " 
to phosphorylate fusion proteins of the present 
invention. For example, the cAMP-dependent protein 
kinase (e.g., the catalytic subuhit of the 
cAMP-dependent protein kinase from bovine heart 
muscle) can recognize the consensus amino acid 
sequence Arg-Arg-Xaa-Ser-Xaa , where Xaa is an amino 
acid, in a variety of substrates, resulting in 
phosphorylation of serine. As shown in the Examples, 
expression vectors of the present invention (e.g., 
20 PGEX-2TK derivatives, or pAR(ARI) 59/60 derivatives), 
incorporating a sequence which encodes a version of 
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the cAMP-dependent protein kinase recognition 
sequence (Arg-Arg-Ala-Ser-Val) , can direct the 
production of fusion proteins capable of being 
phosphorylated by cAMP-dependent protein kinase from 
bovine heart muscle. The Arg-Arg-Ala-Ser-Val 
sequence is also referred to as the HMK site (HMK, 
heart muscle kinase) . Other specific amino acid 
sequences, such as Arg-Arg-Ala-Ser-Leu (Li et. al . , 
Proc. Natl. Sci. USA 86 : 558-562 (1989) and the 
peptide Arg-Thr-Lys-Arg-Ser-Gly-Ser-Val , can be 
recognized and phosphorylated by this kinase. The 
phosphorylation sites can function at a terminal or 
internal location within a fusion protein. 

cGMP protein kinases have a substrate 
specificity that is similar, but not identical, to 
that of the cAMP-de'pendent protein kinases. 
Comparative analysis of substrates has been made 
(Edelman, A.M. et al . , Ann. Rev. Biochem . 56: 
567-613, (1987); Glass, D.B. and E.G. Krebs, Ann. 
Rev. Pharmacol, Toxicol. 20 ; 363-388 (1980); Glass, 
D.B. and E.G. Krebs, J. Biol. Chem. 254: 9728-9738 
(1979))- The substrate specificities of the two 
types of casein kinases has also been studied, and 
the activity of peptide substrates has been compared 
(Marin, O. et al . , Eur. J. Biochem. 160 ; 239-244 
(1986); Sommercorn, J. and- E.G. Krebs, J. Biol. Chem . 
262: 3839-3843 (1987); Kuenzel, E.A. et al ., J. Biol. 
Chem. 262 : 9136-9140 (1987)). For example, casein 
kinase II was shown to phosphorylate the synthetic 
peptide Ser-Glu-Glu-Glu-Glu-Glu . Additional peptide 
substrates were phosphorylated by casein kinase II 
(e.g., in decreasing order of activity. 
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Arg-Arg-Arg-Asp-Asp-Asp-Ser-Asp-Asp-Asp ; Arg-Arg- 
Arg-Glu-Glu-Glu-Ser-Glu-Glu-Glu; Arg-Arg-Arg-Glu-Glu- 
Glu-Thr-Glu-Glu-Glu) . The substrate specificity of 
tyrosine kinases has also been studied (Hunter, T. 
and J. A. Cooper, Ann. Rev. Biochm . 54 : 897-930 
(1985)). Candidate phosphorylation sites 
(recognition sequences) can be incorporated into 
synthetic peptides or into fusion proteins and 
assayed for activity as protein kinase substrates, 
using techniques similar to those described 
previously. Because corresponding phosphatases 
exist, enzymatic removal of phosphate groups is also 
possible. 

The ability to modify the fusion proteins 
provides a convenient method of specifically labeling 
the fusion proteins with detectable radioactive or 
non-radioactive labels to produce a labeled fusion 
protein (e.g., a [ ^^P] -labeled fusion protein). 
Cleavage of fusion proteins can produce a selected 
polypeptide portion comprising a modification 
sequence or an affinity ligand portion comprising a 
modification sequence, depending on the location of 
the cleavable linker in relation to these components. 
Thus, cleavage following labeling can produce a 
labeled selected polypeptide portion or a labeled 
affinity ligand portion, each of which is itself a 
fusion protein. Because the modification recognition 
sequence directs modification to a specific site in 
the protein, there is a high degree of control over 
the location and extent of modification. As shown in 
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the Examples, fusion proteins expressed and modified 
by the introduction of a detectable label retain both 
structure and function. 

The detectable label (e.g., radioactive, 
^5 fluorescent or chemi luminescent) selected will 
determine by the nature of modification, the 
modification enzyme, and the intended use. The label 
will be incorporated into a moiety which is 
transferred to the fusion protein substrate. For 
phosphorylation reactions, [7-labeled] ATP is used as 
phosphate donor (i.e., modification donor). As 
protein kinases transfer the 7-phosphate onto the 
substrate during the phosphorylation reaction, the 
label will be incorporated into the moiety 
transferred by the protein kinase. For example, to 

incorporate a radioactive phosphate label such as 

31 32 33 31 

P, P or P, phosphate donors such as [7 P]ATP, 

32 33 
[7 P]ATP, or [7 P]ATP can be used. Alternatively, 

35 38 

isotopes of sulfur, such as S or S isotopes, can 
be incorporated as the label in the [7-labeled] ATP, 
as for example in "^^S-labeled adenosine 
5' [7-thio] triphosphate. The term "phosphorylation" 
also refers to modification with such thiophosphate 
analogs or other 7-labeled analogs of ATP. Other 

25 

considerations such as specific activity, half -life, 
type of particle emitted and energy of radiation will 
influence selection of an appropriate radioactive 
label. In addition, a fluorescent moiety or 
chemiluminescent moiety incorporated into the 
30 7-phosphate. Detection methods for such labels are 
well known in the art. 
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In other types of modifications (e.g., glyco- 
sylation, fatty acylation) , a radioactive isotope, 
chemiluminescent or fluorescent dye, or other 
nonradioactive label, could be incorporated into the 
modification donor so that it is transferred to the 
fusion protein during the modification reaction. For 
example, a biotin adduct of a modification donor 
could be linked to a fusion protein- In the present 
method, the site of addition of the biotin label is 
controlled by the location- of the modification 
recognition sequence. 

Specific Embodiments 

In one embodiment of the present invention, 
glutathione-S-transf erase (GST) from the parasitic 
helminth Schistosoma japonica (Mr=26,000) is selected 
as the affinity ligand. However, GST genes from 
other sources, such as other bacteria or mammalian 
organisms, can be used. In addition, a portion of a 
GST gene encoding a portion of GST capable of binding 
a specific binding partner, such as the substrate 
glutathione, can be used. In particular, expression 
vectors capable of expressing S. japonicum GST-fusion 
proteins (i.e., a fusion protein comprising a GST 
affinity ligand) were constructed. The parenf vector 
is named pGEX-2TK, and vectors derived from pGEX-2TK 
by the insertion of a nucleotide sequence encoding a 
selected polypeptide, are referred to with the prefix 
pGTK-. The construction of pGEX-2TK and pGTK- 
plasmids is described' in Examples l and 2 and in 
Figure 1. 
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PGEX-2TK is a derivative of pGE>:-2T, the latter which 
is described by D.B. Smith in EP 0,293,249, published 
November 30, 1988, PCT/AU88/00164 , published December 
1, 1988, and New Zealand Patent No. 224,663, issued 
05 November 27, 1990, and by D.B. Smith and K.S. Johnson 
in Gene 67: 31-40 (1988) . The teachings of EP 
0,293,249, PCT/AU88/00164, New Zealand Patent No. 
224,663, and Smith and K.S. Johnson ( Gene 67 : 31-40 
(1988)) are herein incorporated by reference. 

pGEX-2TK encodes a fusion protein having GST as 
an affinity ligand at the amino terminus, followed by 
a thrombin cleavage site (cleavable linker) . In 
addition, a modification recognition sequence for a 
protein )cinase was incorporated downstream of the 
cleavable linker. In particular, a sequence of the 
structure Arg-Arg-Xaa-Ser-Xaa , where Xaa is an amino 
acid, was selected (Arg-Arg-Ala-Ser-Val) . This 
sequence is a phosphorylation site recognized by 
cAMP-dependent protein kinases such as the catalytic, 
domain of the cAMP-dependent protein kinase from 
bovine heart muscle. In pGEX-2TK, the sequence which 
encodes the thrombin cleavage site is followed by 
several restriction sites for the insertion of 
sequences encoding a selected polypeptide. pGEX-2TK 
retains the inducible tac promoter for expression, 
three in frame stop codons, a selectable marker 
(ampicillin resistance) , origin of replication, and 
laclq gene present in pGEX-2T. 

When a sequence which encodes a selected 
polypeptide is inserted in frame into the pGEX-2TK 
expression vector downstream of the phosphorylation 
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site, a GTK-fusion protein comprising the selected 
polypeptide can be produced. The GTK-fusion protein 
(where, from N- to C-terminus, G is a GST affinity 
l^gand, T is a thrombin cleavage site, and K is a 
phosphorylation site for a protein kinase) is encoded 
by a pGTK-vector, can be produced upon expression in 
a suitable E. coli host. These fusion proteins can 
be captured on immobilized glutathione, and labeled 
by phosphorylation with cAMP-dependent protein 
^0 kinase, using [7-^^P]ATP as a modification donor. As 
shown in the Examples, several different cDNAs were 
expressed from pGEX-2TK, and labeled as described. 
The studies described in Examples 1-4 indicate that a 
selected polypeptide present in a GTK-fusion protein, 
encoded by a pGTK-type vector, such as pGTK-RB 
(379-792), can retain both structure and function. 
Furthermore, the labeling (phosphorylation) procedure 
was efficient and did not alter the structure or 
function of the selected polypeptides. 

In another embodiment of the present invention, 
the FLAG epitope is selected as the affinity ligand.' 
Vector pAR(fiRi)59/60, shown in Figure 2, is an 
example of this type of expression vector. Vector 
PAR(ARI) 59/60 is a derivative of bacterial expression 
plasmid pET3a (pAR3040) described by Studier et al. 
Meth. Enzymo]. l85;60-89 (1990), the teachings of " 
which are herein incorporated by reference. The 
construction of pAR(/^Ri) 59/60 is described in detail 
in Example 5. In this construct, the FLAG peptide is 
located adjacent to the N-terminal initiator 
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methionine (Figure 2). The FLAG octapeptide 
(Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lvs) comprises the FLAG 
epitope (Asp-Tyr-Lys-Asp) affinity ligand and an 
overlapping enterokinase cleavable linker 
05 (Asp-Asp-Asp-Asp-Lys) . Cleavage with enterokinase 
typically occurs precisely after the terminal lysine 
(underlined above) of the FLAG octapeptide. The FLAG 
octapeptide is followed by a kinase recognition site. 
In particular, in pAR (z^RI) 59/60, the site is 
10 Arg-Arg-Ala-Ser-Val (HMK) , recognized by cAMP- 

dependent protein kinase. A unique EcoRI restriction 
site located downsrream of the sequence encoding the 
phosphorylation site can be used for the insertion of 
a sequence encoding a selected polypeptide. In 
15 plasmids derived from pAR (aRI) 59/60 , referred to with 
the prefix pFEK- F for FLAG, E for enterokinase, and 
K for the phosphorylation recognition site) , in which 
a sequence encoding a selected polypeptide has been 
inserted, the encoded fusion protein, referred to 
20 herein as a FEK-fusion protein, has the following 
general structure for N- to C-terminus: • Met-[FLAG/ 
enterokinase cleavable linker ] -phosphorylation 
site-selected polypeptide. An alanine residue is 
located between the FLAG peptide and HMK site (see 
25 Figure 2). In addition, the EcoRI site encodes a 
Glu-Phe dipeptide. Depending on the strategy for 
insertion of a sequence encoding a selected poly- 
peptide, additional residues (e.g., encoded by a 
linker or PGR primer) may be inserted between the 
selected polypeptide and the modification recognition 
(HMK) site. In many cases, however, only 17 amino 
acids will be fused to the selected polypeptide. 
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pFEK-plasmids can direct the expression of FLAG- 
fusion proteins (i.e., fusion proteins having a FLAG 
epitope affinity ligand) from the T7 polymerase 
promoter in an appropriate host (e.g., a bacterial 
cell capable of constitutive or inducible expression 
of the T7 polymerase; for examples of suitable hosts 
and induction protocols, see Studier et al. , Meth. 
Enzvmol. i£5: 60-89 (1990)). The construction of 
several pFEK-plasmids is described in Example 5. The 
10 encoded fusion proteins were expressed in a bacterial 
host, and a lysate was prepared. Partially purified 
fractions containing each fusion protein were 
subjected to phosphorylation with the catalytic 
subunit of the cAMP-dependent protein ' kinase using 
^5 li~ P]ATP as a (labeled) modification donor. The 
FEK-fusion proteins were labeled to high specific 
activity to give { ^^P] -labeled FEK-fusion proteins 
(which are also [^^PJ-labeled FLAG-fusion proteins). 

Note that vectors of the present invention 
related to pGTK- and pF.EK- vectors can be constructed' 
lacking the cleavable linker sequence. Thfese vectors 
would have a pGK- or pFK- prefix. A fusion protein 
comprising a selected polypeptide expressed from a 
pGK- or pFK-vector is referred to as pGK-fusion 
protein or pFK-fusion protein, respectively. 

A variety of commercially available vectors 
comprising an affinity ligand and cleavable linker 
are available which can be modified by the 
introduction of a modification site (e.g., a 
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phosphorylation site) , and where required, one or 
more restriction sites for the in-frame insertion of 
a selected polypeptide using recombinant DNA 
techniques. For example, (1) the pGEX--3X vector 

05 (Pharmacia) comprising a GST affinity ligand and 
Factor Xa cleavable linker, (2) the pKAL-C vector 
(Biolabs) comprising a malB affinity ligand and 
Factor Xa cleavable linker, (3, 4) the protein A gene 
fusion expression vectors, pRIT2T and pRITS 

^0 (Pharmacia), comprising a protein A affinity ligand, 
(5, 6) as well as pDS and pQE-vectors (Qiagen) , 
comprising a histidine hexamer affinity ligand, could 
be modified in this way. The vector pRIT2T, vector 
pRITS, pDS and pQE-vectors, could be further modified 
by the insertion of a sequence encoding a cleavable 
linker. Convenient affinity matrices for fusion 
proteins encoded by the foregoing were discussed 
above . 

Methods of Producing a Modified Fusion Protein 

20 

The present invention further relates to methods 
for producing a modified fusion protein. In 
particular, a rapid and convenient method for 
labeling a fusion protein using a radiolabel or 
non-radioactive label donated by [7-labeled] ATP is 

25 

provided. In addition, the methods used are mild; a 
feature which is important for preservation of the 
structure and biological activity (e.g., binding, 
antigenicity, activity) of the components of the 
fusion protein, and of the selected polypeptide in- 
30 particular. 
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For example, fusion proteins of the present 
invention comprising an affinity ligand portion and a 
selected polypeptide portion are expressed in a host 
cell carrying a vector of the present invention. The 

05 host cells are propagated under conditions which 
permit expression of the vector. For example, 
expression .from the particular pGTK- and pFEK- 
vectors described in the Examples requires induction 
by IPTG. The host cells are lysed using known 

^0 techniques to obtain a lysate containing the fusion 
protein. 

In one embodiment, the lysate can be crudely 
fractionated as in Example 5 and directly labeled in 
the presence of [7-labeled)ATP (e.g., [7 -^^P]ATP) , 
and a cAMP-dependent protein kinase, such as the 
catalytic subunit of the cAMP-dependent protein 
kinase from bovine heart muscle. A suitable 
formulation for a kinase reaction buffer and reaction 
stop buffer is given below. 

In another embodiment, the fusion protein is 
modified (e.g., phosphorylated) while bound to an 
affinity matrix. The fusion protein present in the 
lysate is captured on an affinity matrix. This step 
is carried out by contacting the lysate with an 
appropriate affinity matrix (i.e., an affinity matrix 
comprising a specific binding partner of the affinity 
ligand present in the fusion protein) , under 
conditions which permit binding of the affinity 
ligand portion of the fusion protein to the affinity 
30 matrix. Suitable conditions for a variety of 

affinity ligands and matrices are known in the art. 
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For example, GST-fusion proteins (e.g., a 
GTK-fusion) can be captured on iminobilized 
glutathione as an affinity matrix. As shown in 
Example 2, the GST-fusion protein can be captured on 

05 the affinity matrix in the presence of a wash buffer, 
(which was also used as the lysis buffer) . The 
components of the wash buffer permit binding of the 
GST portion of the fusion proteins to the matrix via 
the specific binding partner (glutathione) . In one 
embodiment, the wash buffer comprises a buffer such 
as Tris, salt (e.g., NaCl) , a chelator (e.g., EDTA) , 
and a non-ionic detergent such as nonidet P-4 0. 

As discussed above, FEK-fusion proteins can be 
captured on an anti-FLAG antibody affinity matrix, 

^5 When the FLAG peptide is internal (e.g., not 

immediately at the N-terminus, as in pAR(ARI) 59/60) , 
anti--FLAG M2 antibody can be used as the specific 
binding partner. Suitable conditions and 
formulations (e.g., for wash buffer and elution 
buffer) for anti-FLAG M2 affinity chromatography can 
be obtained from International Biotechnologies, Inc. 

Once bound, the fusion proteins can then be 
washed in order to remove unbound material (e.g., 
contaminating proteins other than the fusion 
protein) . The wash buffers described above are 
suitable for this purpose. 

The affinity matrix with bound fusion protein 
attached is then equilibrated (contacted) with a 
reaction buffer suitable for the modification 

30 reaction. In a phosphorylation reaction, for 
example, a kinase buffer such as HMK buffer is 
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selected. The particular kinase reaction buffer will 
be adapted to the kinase selected. Suitable reaction 
buffers for a variety of characterized kinases are 
available, kkk reaction buffer, suitable for use 
with the cAMP-dependent protein kinase from bovine 
heart n,uscle (HMK) , cor^prises (i) a buffering agent 
such as Tris, (2) a reducing agent such as 
dithiothreitol (DTT) , (3) a salt such as NaCl, and 
(4) a source of magnesium ions or other appropriate 
ion, such as MgCl^ . Although the DTT can be added 
separately, it is present in the kinase reaction 
buffer. In addition, a modification enzyme and 
niodification donor are added. In one particular 
phosphorylation reaction, a protein kinase prepara- 
tion comprising the HMK enzyme is added, and [7- 
labeled]ATP. Typically, the protein kinase will be 
diluted in a suitable solution prior to addition, m 
the examples, a variety of fusion proteins were 
efficiently radiolabeled by phosphorylation using HMK 
reaction buffer, the catalytic subunit of the HMK and~ 
It- P]ATP as a modification donor. 

The conditions appropriate for phosphorylation 
of the bound fusion protein to produce a bound 
labeled (e.g., radiolabeled) fusion protein will vary 
with the protein kinase selected. Typically, the 
steps subsequent to culturing the host cell are 
carried out at 4 'C. However, phosphorylation 
reactions with the catalytic subunit of the 
CAMP-dependent protein kinase from bovine heart 
muscle, have previously been carried out at 37 'c. 
m the present method, phosphorylation can be carried 
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out at 37 However, in another embodiment of "the 

present invention, efficient phosphorylation is 
carried out at 4 ""C. This temperature can preserve 
the activity of fusion proteins, and of the selected 

05 polypeptide portion in particular. 

Optionally, the reaction can be quenched by 
addition of a stop buffer comprising a coinponent 
capable of inhibiting the modification reaction. The 
component capable of inhibiting the reaction will 

10 vary with the modification enzyme, but can include 
inhibitors (e.g., reaction products, competitors), 
chelators, or other agents which stop the reaction. 
It can be desirable to stop a reaction when, for 
example, in subsequent steps, undesired modifications 

15 can occur. However, wash steps can serve to remove 
modification enzymes. 

For example, in a phosphorylation reaction with 
cAMP-dependent protein kinase, an HMK stop buffer can 
be added, comprising a buffer such as sodium phos- 

20 phate, a reaction inhibitor such as sodium pyrophos- 
phate, and a chelator such as EDTA. HMK stop buffer 
can optionally contain a carrier such as bovine serum 
albumin or glycerol. As a reaction product of a 
phosphorylation reaction, sodium pyrophosphate 

25 inhibits the kinase reaction. In addition, chelating 
2+ . 

Mg ions inhibits the reaction. 

An advantage of the present method is that 
unincorporated label is easily removed by washing the 
affinity matrix with bound labeled fusion protein. 
^0 The wash buffers described above, which permit 

binding of the affinity ligand to the affinity matrix 
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via the specific binding partner can be used in this 
step, other methods for labeling proteins with the 
HMK enzyme have relied upon extensive dialysis to 
remove unincorporated label (e.g., Zhao, X.-X. et 
°^ ' Analvt. Biochem. 178: 342-347 (1989)). The 

removal of unincorporated label by washing is 
advantageous because extensive dialysis can 
compromise the function of some proteins and results 
in the production of large quantities of contaminated 
dialysis buffer. 

If desired, the modified fusion protein may be 
isolated for use by releasing the fusion protein from 
the affinity matrix by washing with a suitable 
elution buffer. Elution buffers were discussed 
above. For example, a labeled GST-fusion protein 
such as a GTK-fusion protein can be eluted from 
immobilized glutathione with an elution buffer 
comprising glutathione (e.g., reduced glutathione). 
In one embodiment, an elution buffer comprising 
reduced glutathione, a buffer such as Tris, and a 
salt such as NaCl is used to release labeled 
GST-fusion .proteins from the affinity matrix. 

In the case where a cleavable linker is present, 
after elution, the fusion protein can be cleaved in 
vitro to free the modified (e.g., labeled) 
polypeptide portion. Cleavage is accomplished by 
contacting the fusion protein with an appropriate 
specific protease under conditions which permit the 
cleavage reaction. Such conditions for cleavage by 
enterokinase, thrombin and factor Xa, for example, 
are known. The affinity ligand portion or any 



wo 93/03157 



PCT/US92/06187 



-33- 

uncleaved product can be removed by adsorption on the 
appropriate affinity matrix in the presence of wash 
buffer for example. The modified or labeled selected 
polypeptide portion can be recovered in the 
05 supernatant. 

Optionally, prior to elution from the affinity 
matrix, the modified polypeptide portion can be 
released from the affinity ligand portion by cleaving 
the modified fusion protein at the cleavable linker. 

10 An example protocol for cleavage by a specific 

protease on a column is provided in Abath, F- and A. 
Simpson, Biotechnicues 10: 'l78 (1991), the teachings 
of which are herein incorporated by reference. 

Smith (EP 0,293,249; PCT/AU88/00164 ; NZ 224,663) 

15 and Smith and Johnson ( Gene 67: 31-40 1988)) also 
describe conditions for capture of GST-fusion 
proteins on immobilized glutathione, washes, elution 
and cleavage protocols. 

In one embodiment of the present invention, a 

20 kit for preparing a labeled fusion protein is 

provided, comprising (1) an affinity matrix to which 
a portion of the fusion protein can bind, (2) a wash 
buffer which permits binding of the fusion protein to 
the affinity matrix, (3) a modification reaction 

25 buffer such as a protein kinase reaction buffer, and 
(4) a modification enzyme preparation such as a 
protein kinase preparation. Optionally, the kit can 
contain a reaction stop buffer or an elution buffer 
comprising a release component. The nature of these 

30 elements has been explained above in more detail. 

In another embodiment, a kit can comprise a 
vector of the present invention (e.g., pGEX-2TK or 
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PARI.RIJ 59/60). The vector can b, present in e 
suitable host cell or free Jrom the host cell 
rurther^ore, the Hit comprises each ol the elements 
(1) through Of the kit described above, and if 
desired, a reaction stop buffer. m other 
embodiments, a modification donor such as ,-labeled 
ATP can also be included in the kit. 

preparati • Protein Mnase 

' in tL " " • protein kinase, 

in th T""" '="P-"i'=''. the kinase may be 

can^: d"" T ' - -^«-n. The kLase 

can be dissolved or diluted as necessary prior to 

cont.r° - -"able 

e!bod " cartridge,. I„ one 

embodiment, th. affinity „atri. is provided in a 

cartridge unit, the combination of which acts as a 

ffers, label, can be inserted into the cartridge 

the la"::- """"^ ^-ilitate- 

labelmg procedure, m one embodiment, 

.™obili.ed glutathione can be incorporated into a 

cartridge unit for capture of OST-fu=io„ proteins. 

Pses of th. Present- t ■■- 

expreIs*!on"/f 

te\?f™pi::i:/::rb?cr^".^" - 

uses in research, diagnostic and therapeutic 
applications. 
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For example, radiolabeled proteins are useful in 
imaging. In one embodiment, radiolabeled fusion 
protein or radiolabeled selected polypeptide portion 
can be used to locate cells carrying a ligand 
05 recognized by the selected polypeptide (e.g., a 

receptor, cell surface component, or antigen). For 
example, a growth factor can be used as the selected 
polypeptide, and a labeled fusion protein of the 
present invention comprising the growth factor can be 
10 used to detect cells carrying a receptor for the 
growth factor. 

Similarly, incorporation of a suitable 
radiolabel permits the use of fusion proteins of the 
present invention in targeted radiotherapy. in other 
15 words, the selected polypeptide (e.g., a "hormone, 
growth factor, antibody) present in the fusion 
protein can seek out specific cells and damage or 
destroy them due to the attached radiolabel. An 
isotope with the desired energy and half-life can be 
20 selected for this purpose. Other labels incorporated 
by methods of the present invention and capable of 
killing cells can also be used. 

For example, a labeled antibody or portion 
thereof (e.g., F^, Fab, etc.) produced by the method 
25 of the present invention could be used in this 
manner. Such fusion proteins would be useful for 
many other applications as well. The term antibody 
as used herein refers to antibodies such as chimeric 
antibodies, single chain antibodies, bifunctional 
antibodies and other antibody variants, including 
individual chains (e.g., a heavy chain). A variety 
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of methods for expressing immunoglobulins are 
available (see e.g., Skerra, A. and A. Pluckthun, 
Science 240: 1038-1041 (1988)). 

In one embc-diinent , antibody chains are expressed 
05 as fusion proteins of the present invention in 

coli- A signal sequence (e.g., ompA, phoA) is fused 
to the N-terminus of an antibody chain, which is 
followed by at least one modification recognition 
sequence, an optional cleavable linker, and an 
10 affinity ligand. On expression in e. coli . the 

signal sequence is cleaved from the fusion protein, 
freeing the amino terminus of the antibody chain, and 
preserving the binding function of the variable 
region. Fusion genes encoding both chains can be 
15 expressed in this manner from two vectors or a single 
vector encoding two fusion genes. Alternatively, 
either the heavy or light chain could be expressed as 
a fusion protein of the present invention and the 
complementary chain could be expressed in the same 
20 cell using typical antibody expression vectors. 

In addition, as shown herein, labeled fusion 
proteins of the present invention can be used as 
probes. As shown in Example 4, labeled fusion 
proteins such as ^^p.^^K-fusion proteins, can be used 
as probes for screening expression libraries for 
proteins capable of binding the selected polypeptide. 
For example, a fusion protein of the present 
invention comprising a selected polypeptide is 
expressed, purified by virtue of the affinity ligand. 
and specifically labeled at the modification 
recognition sequence. The labeled, homogeneous 



25 



30 



rjTT euccrr 



wo 93/03157 PCT/US92/06187 



-37- 

fusion protein is then incubated with filters on 
which proteins expressed by individual plaques of a 
target library have been immobilized. Following this 
hybridization step, the filters are washed, and 

05 processed for identification (i.e., by detection of 
the label) of plaques which produce recombinant 
proteins capable of specifically interacting with the 
labeled fusion protein probe- 
In particular, a cDNA encoding a portion (the E7 

^0 binding domain) of the retinoblastoma susceptibility 
gene product (pRB) was prepared as a GTK-fusion 
protein and used to screen a library for proteins 
which interact with pRB. A number of positive clones 
were isolated. One of these clones, designated 
RBAPl, displays properties expected of a cellular 
protein capable of interacting with pRB. Labeled 
fusion proteins of the present invention comprising a 
selected polypeptide can also be used as probes in 
Western blot formats (Example 4). 

20 In addition, as shown in Example 4, labeled 

fusion proteins of the present invention can be used 
to identify proteins present in complex mixtures 
(e.g., whole cell extracts), which are capable of 
interacting with the selected polypeptide. The 
fusion proteins can be used in protocols to isolate 
the interacting proteins. For example, the 
interacting proteins can be captured by the selected 
polypeptide portion of a labeled fusion protein which 
is bound to an affinity matrix. Subsequent cleavage 

30 at a cleavable linker could release the complex of 
the interacting protein and a labeled selected 
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polypeptide portion for further analysis. 
Alternatively, a purified labeled fusion protein 
probe can also be used to determine the amount of or 
to detect an interacting protein or other substance 
in a complex mixture using immunoassay techniques, 
for example. 

In one embodiment, the binding of a specifically 
interacting protein (e.g., an antigen, hormone) or 
other substance to an immobilized (e.g., bound to an 
affinity matrix) fusion protein of the present 
invention can induce a change in the attributes of 
the bound fusion protein, other substances include, 
but are not limited to agents such as a drug, toxin, 
substrate or other ligands capable of interacting 
with the selected polypeptide portion of the fusion 
protein. The difference between the unbound versus 
the bound state could be taken advantage of, as for 
example, in an assay format. For example, the 
presence of the interacting protein or other 
substance in a complex mixture (e.g., blood) could be" 
monitored. For example, binding of an interacting 
protein could induce a conformational change that 
shields the labeling (e.g. phosphorylation) site. 
The complex mixture containing an interacting protein 
is contacted with the fusion protein, which is bound 
to the affinity matrix. The combination is washed, 
equilibrated with a suitable reaction buffer and 
subjected to the labeling procedure. The extent of 
labeling is compared to a control which lacks the 
interacting polypeptide and the presence of the 
interacting protein in the complex mixture is 
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indicated by inhibition of labeling. By comparison 
with the degree of inhibition achieved by a standard, 
such a procedure can also provide information 
regarding the quantity of interacting protein which 

05 is present. 

The vectors and methods of the present invention 
can facilitate the characterization of cDNA clones. 
For example, an uncharacterized cDNA can be inserted 
into a vector of the present invention allowing rapid 

10 purification of the protein, without specific 

knowledge of its properties. The protein can be 
labeled with the maintenance of biological activity, 
and the labeled fusion protein can be used to detect 
potential interactions with other cellular proteins. 

13 The present invention will now be illustrated by 

the following Examples, which are not intended to be 
limiting in any way. 

Introduction to Examples 1-4 

Recently it has been demonstrated that the 

20 adenovirus ElA protein, SV4 0 and polyomaviral T 
antigens, and the human papilloma virus E7 protein 
can bind to the retinoblastoma susceptibility gene 
product, pRB. Neither these viruses nor these 
proteins are thought to be closely related to one 

25 another, yet each of these proteins shares a short, 
homologous, colinear, transforming sequence, typified 
by ElA conserved region 2 (CR2), which appears to be 
responsible for high affinity binding to pRB. The 
region of pRB which interacts with this motif has 
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been mapped. To date, all spontaneously occurring, 
loss of function, pRB mutations which do not grossly 
comproirdse pRB stability, map to this region. This 
led to the hypothesis that pRB, in the course of 
regulating cell growth, must interact with one or 
more cellular proteins bearing a sequence 
structurally resembling the viral pRB-binding motif. 
If such a model were correct, one would predict that 
pRB inactivation might be achieved either as a result 
of competing complex formation with one of the above 
viral oncoproteins, or as a result of RB mutations 
affecting the T/ElA/E7-binGing domain. 
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Example 1 

Construction of pG£X-2TK 

Tvo synthetic oligonucleotides were designed, 
which, on annealing, encode a cAWP-dependent protein 
05 kinase recognition site and three restriction sites. 
Codon utilization for the synthetic duplex was based 
on the prokaryotic codon utilization data of 
(Grantham, R. et al . , Nucleic Acids. Res . 9: r43-r74 
(1981)) . 

^0 The oligonucleotides were synthesized using 

standard techniques and were annealed in vitro (Gait, 
M.C.J. , (1984) Oligonucleotide Synthesis:. A 
Practical Approach , (I.R.L. Press; Oxford)). The 
resulting duplex has 5 '-overhangs compatible with 

i5 BamHI- and EcoRI-cut DNA and is shown below: 

ArgArgAlaSerVal 
5 ' -GATCTCGTCGTGCATCTGTTGGATCCCCGGG 

AGCAGCACGTAGACAACCTAGGGGCCCTTAA-5' 

Plasmid pGEX-2T (Pharmacia) was linearized with BamHI 
and EcoRI;' and the vector fragment was ligated to the 
synthetic duplex to make plasmid pGEX-2TK. 
Incorporation of the duplex into pGEX-2T was 
confirmed by restriction and DNA sequence analysis. 
DNA sequencing was performed using a Sequenase 2.0 
kit (United States Biochemical Corp.) with the 
protocol provided by the manufacturer. The. structure 

25 

of pGEX-2TK in the region surrounding the kinase site 
is shown in Figure 1. 
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As can be seen in Figure 1, the incorporation of 
the synthetic duplex DNA into pGEX~2T resulted in the 
insertion of codons for a cAMP-dependent protein 
kinase recognition sequence, Arg-Arg-Ala-Ser-Val , 
05 immediately downstream of the thrombin recognition 
site present in parent plamid pGEX-2T. In addition, 
the synthetic duplex encoded a multiple cloning site 
(MCS) downstream of the kinase recognition sequence, 
such that the insertion of the duplex led to the 
regeneration of the MCS of the parent plasmid, with 
restriction sites for BamHI , Smal, and EcoRI. As in 
the parent plasmid, the MCS is followed by a sequence 
with stop codons in all three reading frames. 

Example 2 

Generation of "^^P-GST Fusion Proteins 

Construction of pGEX-2TK Vectors Encoding RB Proteins 

RB cDNAs encoding residues 379-792 or residues 
379-928 of the retinoblastoma susceptibility gene 
product, both of which span the T/ElA-binding region, 
20 were subcloned into pGEX-2TK- In addition, an RB 
cDNA encoding residues 379-928, having a naturally 
occurring, loss of function, RB point mutation 
(379-928;706F) , which is known to abrogate T/EIA 
binding, was subcloned into pGEX-2TK.. These pGEX-2TK 
recombinants are named pGTK-RB(379-792) , 
pGTK-RB(379-928) and pGTK-RB ( 379-928 ;706F) , ■ 
respectively. The encoded fusion proteins are named 
GTK-RB(379-792) , GTK-RB ( 37 S-928 ) and 
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GTK-RB (379-928 ;706F), respectively. These RB fusion 
proteins are also referred to as GST-RB fusion 
proteins or RB fusion proreins. 

Two of the RB cDNAs had been generated in an 

05 earlier study by Kaelin et al . (Kaelin, W.G. et al . , 
Mol. Cell. Biol . 10(7) : 3761-3769 (1990)), 
incorporated herein by reference, using the 
polyinerase chain reaction. The RB cDNA inserted into 
pGEX-2TK to make pGTK-RB ( 379-928 ) corresponds to RB 

10 deletion RB dl 1-378 (Kaelin, W.G. et al . , Mol. Cell. 
Biol . 10(7) : 3761-3769 (1990)). The RB cDNA inserted 
into pGEX-2TK to make pGTK-RB ( 379-792 ) corresponds to 
RB deletion RB dl 1-378 ; 793-928 . Residue 379 is a 
methionine residue. 

15 Each amplimer used to generate the cDNAs 

contained a BamHI site, such that the resulting PGR 
product, upon digestion with BamHI, could be ligated 
in frame into the unique BamHI site in pGEX-2T 
(Pharmacia). The 3' amplimer contained a TGA stop 

20 codon as well. The PGR fragments encoding these two 
cDNAs were each cleaved with BamHI and cloned into 
pGEX-2T, which had been linearized with BamHI, and 
treated with calf intestinal phosphatase. The. 
resulting constructs were named pGT-RB ( 379-928 ) and 

25 pGT-RB(379-792) . The cDNAs were cleaved from the 
latter constructs using BamHI, and were each 
subcloned into pGEX-2TK, which had been cleaved with 
BamHI. The resulting plasmids are named 
pGTK-RB (379-928) and pGTK-RB { 379-792 ) . Two 

2^ additional amino acids were incorporated into the 
sequence at the BamHI site due to the structure of 
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the PCR priwers used to clone both cDNAs , such that 
the amino acid sequence at the junction of the 
phosphorylation (kinase) site and the first RB 
residue (Met^^^) is: 

(-Arg-Arg-Ala-Ser-Val-Gly-Ser-Ala-Thr-Met -) 

The mutant RB cDNA and referred to herein as 
either RB(379-928;706F) or RB (379-928 ;pm706) 
corresponds to the cDNA designated (379-928;pm 706) 
constructed by Kaelin et al . (Kaelin, W.g. et al .. 
Cell 64: 521-532 (1991)), also incorporated herein by 
reference. The mutant cDNA fragment was generated by 
PCR of reverse-transcribed mRNA from cells containing 
the mutation. The PCR product was cleaved at the 
unique Ncol and BsmI sites in the RB gene^ (which span 
the 706F mutation), and ligated into pGT-RB (379-928) , 
from which the wild type RB cDNA Ncol-Bsml segment 
had been excised. The resulting plasmid, 
pGT-RB(379-928;706F) , encoded the 379-928 ; 706F cDNA. 
The plasmid was cleaved with BamHI to release the 
CDNA, and the cDNA fragment was subcloned into 
PGEX-2TK which had been cleaved with BaroHl to make 
PGTK-RB(379-928;706F) . The amino acid sequence at 
the junction of the phosphorylation (kinase) site and 
the first RB residue (Met^.^^) is also 

(-Arg-Arg-Ala-Ser-Val-Gly-Ser-Ala-Thr-Met -) 

379 ' ' 

The products encoded by all three cDNAs have 
been tested previously for their ability to bind to 
SV4 0 T antigen, the adenovirus ElA gene product, and 
putative cellular RB-binding proteins, when expressed 
as PGEX-2T encoded GST fusion proteins. The fusion 
proteins encoded by pGT-RB (379-928) and 
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pGT-RB(379-792) were able to bind to T antigen, ElA, 
and the putative cellular RB-binding proteins, while 
the mutant protein encoded by pGT-RB(379-928;706F) 
did not (Kaelin, W.G. et al . , Cell e±: 521-532 
0^ (1991)). 

Expression and Purification of GST-Fusion Proteins 

The expression of the pGEX-2TK-encoded protein 
and recombinant pGEX-2TK-encoded GST fusion proteins 
in E, coli (DH5q; Bethesda Research Laboratories), 
and the subsequent recovery of the proteins on 
glutathione sepharose was carried out essentially as 
described by Smith and Johnson (Smith, D.B. and 
Johnson, K.S., Gene 67: 31-40 (1988); Kaelin, W.G. et 
al. , Cell 64: 521-532 (1991)). Fresh overnight 
cultures of E. coli DH5q , transformed with either 
pGEX-2TK or pGEX-2TK recombinants, were diluted 1:10 
in Luria-Bertani (LB) medium containing ampicillin 
(100 Mg/ml) and incubated for a total of 5 hours at 

20 37 *»C, with shaking. After 1.0 hour of growth at 37 * 
*»C, isopropyl-^-D-thiogalactopyranoside (IPTG; 
Bethesda Research Laboratories) was added to a final 
concentration of 0.1 mM. 

For analysis of total bacterial protein content, 

25 aliquots of each bacterial culture were pelleted in a 
microcentrifuge, were boiled in urea-SDS cracking 
buffer (0.01 M sodium phosphate [pH,7.2], 1% 
^-mercaptoethanol , 1% SDS, and 8M urea), and were 
loaded onto an SDS-polyacrylamide gel. Proteins were 

2^ visualized by Coomassie blue staining. 

For phosphorylation of protein and/or protein 
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recovery using glutathione sepharose (Pharmacia), 
bacterial cultures were pelleted at 5000 x g for 5 
win at 4 "C, and resuspended l/io the original 
culture volume of NETN (20 juM Tris [pH 8.0], 100 mM 
NaCl, i TOM £DTA, 0.5% Nonidet P-40). The bacteria 
were then lysed on ice by mild sonication and 
subjected to centrif ugation at 10,000 x g for 5 min 
at 4 -C. The clarified bacterial sonicates, 
containing the pGEX-2TK-encoded protein or relevant 
GST fusion protein, were rocked for 15-3 0 minutes at 
4 "C with glutathione sepharose (20-30 ^l/^al 
bacterial sonicate). The glutathione sepharose beads 
(Glutathione Sepharose 4B, Pharmacia)) had been 
washed three times and resuspended l:i v/v in NETN 
(see above) supplemented with 0.5% non-fat powdered 
milk prior to use. This step and subsequent steps 
were also carried out at 4«C. 

Phosphorylation of Proteins 

The sepharose beads, with bound protein, were 
then washed three times with NETN followed by one 
wash with IX HMK buffer (20 mM Tris pH [7.5], 100 mM 
NaCl, 12 mli Mgci^). The supernatant was aspirated 
with a 23G needle and the sepharose was resuspended 
in 2-3 bead volumes of IX HMK buffer containing 1 
unit/pl of the catalytic subunit of cAMP-dependent 
protein kinase (Sigma Chemical Co., St. Louis, MO), 
1 fiCi/^l h-yATP (6000 Ci/mMole, 10 mCi/ml, New 
England Nuclear) , and l mM DTT. The kinase reaction 
was allowed to proceed for 30 minutes at 4 'c, with 
periodic agitation of the sepharose to maintain a 
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suspension. The reaction was terminated by the 
addition of 1.0 ml of HWK stop buffer (10 mM Na 
Phosphate [pH 8.0], 10 mM Na Pyrophosphate, 10 mM 
EDTA, 1 mg/ml bovine serum albumin) . Following a 
05 brief spin in a microcentrifuge, the supernatant was 
again removed using a 23 G needle and the sepharose 
was washed 5 times with NETN to remove any 
unincorporated label. Incorporation of label can be 
determined while the protein is attached to the bead. 

10 Elution With Reduced Glutathione 

For further studies, the fusion protein can be 
eluted from the beads. After the final wash the 
residual supernatant was aspirated using a 23 G 
needle and the labeled or unlabeled GST fusion 

15 protein was eluted by rocking the sepharose for 10-15 
minutes in 10-50 bead volumes of 20 mM reduced 
glutathione, 100 mM Tris [pH 8.0], 120 mM NaCl . 

32 

Effect of HMK Sequence on P-Incorporation In Vitro 
The ability of "the Arg-Arg-Ala-Ser-Val sequence 

20 to convert the GST fusion proteins into ..substrates 
for the catalytic subunit of cAMP dependent protein 
kinase was tested. GST fusion proteins encoded by 
the pGEX-2TK recombinants, as well as the 
corresponding pGEX-2T constructs, were overexpressed 

2^ in bacteria and recovered on glutathione sepharose as 
described above. As described in more detail above, 
the purified fusion proteins, while still 
non-covalently bound to glutathione sepharose, were 
incubated with '^^P-7-ATP and the purified catalytic 
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subunit of cAMP-dependent protein kinase for 30 min 
at 4«C, after which a kinase stop buffer was added. 
The sepharose, with bound protein, was extensively 
washed to remove unincorporated label, transferred to 
clean eppendorf tubes, and subjected to Cerenkov 
counting. 

The GST-RB fusion proteins were readily 
phosphorylated in vitro , provided that the kinase 
recognition sequence was present. Figure 2 shows the 
effect of inserting an HMK sequence on ^^P 
incorporation (cpro) in vitro data on four GST-RB 
fusion proteins, in particular, GST-RB ( 379-792 ) and 
GST-RB (379-792, •pin706) fusion proteins which were 
encoded by pGEX-2TK and which contained an HMK 
15 sequence (hatched bars) , showed significant 

incorporation of ^^p. contrast, GST-RB(379-792) 

and GST-RB(379-792;pin706) fusion proteins which were 
encoded by pGEX-2T and lacked an HMK sequence (solid 
bars) , were not significantly radiolabeled. 

In another experiment, fusion proteins from lo " 
ml of bacterial culture were recovered on 25 m1 of 
glutathione sepharose beads, and were subjected to 
phosphorylation using the same protocol. A 
GST-RB(379-928) fusion protein expressed from 
PGEX-2TK and having a kinase site was labeled with 

P (3 X 10 cpn), while the GST-RB (379-928) fusion 
protein expressed from pGEX-2T and lacking a kinase 
site was not comparably phosphorylated (i x 10^ cpm) . 
Similarly, a GST-RB (379-928 ) ;pm706 fusion protein 
expressed from. pGEX-2TK and having a kinase site was 
labeled with (3 ^ ^^6 ^^^^ ^ ^^.^^ 

GST-RB(379-928);pn,706 fusion protein expressed from 
PGEX-2T and lacking a kinase site was not comparably 
phosphorylated (i x lo'' cpm). 
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Exainple 3 

Structural Integrity of Fusion Proteins 

Expression of Fusion Proteins in E> coli 

As discussed above, the GST-RB fusion proteins 

05 expressed from pGEX-2T-derived plasmids 

pGT-RB (379-928) and pGT-RB (379-792) were able to bind 
to T antigen, ElA, and the putative cellular 
RB-binding proteins, while the mutant fusion protein 
encoded by pGT-RB(379-928 ;706F) did not (Kaelin, W.G. 

10 €t_al., Cell 64: 521-532 (1991)). This observation 
suggests that the binding function and structure of 
the RB proteins is not grossly disturbed when they 
are expressed as part of a fusion protein with GST. 
It was further determined that the insertion of 

15 the Arg-Arg-Ala-Ser-Val (RRASV) sequence did not 
grossly alter bacterial expression of GST-RB fusion 
proteins expressed from pGEX-2TK. For this 
determination, whole cell lysates of IPTG-induced 
cultures were prepared and total bacterial protein 

20 content was analyzed directly on. SDS-polyacrylamide 
gels as described above. Protein from E. coli 
transformed with pGTK-RB (379-928) , pGTK-RB (379-792) , 
or pGTK-RB(379-928;pm706) were analyzed. The band 
intensity for each fusion protein was comparable to 
that for the corresponding construct lacking the 
phosphorylation site. Each fusion protein had an 
apparent molecular weight consistent with the size of 
the pRB fragment encoded by the RB cDNA insert and 
the 2 6 kD GST polypeptide. 
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Purific ation of Fusion Proteins 

The subsequent purification of GST-RB fusion 
proteins expressed from pGEX-2TK was not altered by 
the insertion of the RRASV sequence. Bacterial 
sonicates of each of the transf Orroants were also 
prepared, and proteins were subjected to glutathione 
sepharose chromatography as described above. The 
sepharose beads, with bound protein, were then washed 
three tiroes with NETN, and the bound proteins were 
eluted by boiling in SDS sample buffer (2% SDS, lo % 
glycerol, 62 mM Tris [pH 6.8]). The supernatant was 
subjected to electrophoresis on a 10% polyacrylaniide 
gel and visualized by Coomassie blue staining. After 
purification, the intensity of the bands for 
GTK-fusion proteins expressed in E. coli transformed 
with PGTK-RB (379-928) , pGTK-RB (379-792) , or 
pGTK-RB (379-928 ;pin706) was comparable to that 
observed with for the corresponding constructs 
lacking the )cinase site (i.e., pGT-RB (379-928) , 
pGT-RB(379-792) , or pGT-RB (379-928 ;pro706) , 
respectively) 

The following materials and methods were used in 
subsequent experiments. 

Cells a nd Culture Conditions 

The retinoblastoma cell line WERI-Rb27 (a gift 
of Dr. Wen-Hwa Lee) Huang, H.-J. et al . . Science 242 : 
1563-1566 (1988)) and 293 cells, a human embryonic 
cell line transformed by a fragment of the Adenovirus 
5 genome (Graham, F.L. et al ., J. Gen. Virol . 36: 
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59-72 (1977)), were grown in Dulbecco's modified 
Eagle's mediuTti (DMEM) with 10% fetal calf serum 
(Gibco) . AXata cells were grown in RPMI or DMEM, 
with 10% fetal calf serum (Gibco) . C57B1/6 primary 
mouse embryo fibroblasts (MEF) lines expressing 
either wild-type (MEFC'Tex) or mutant (MEFO'Kl) forms 
of T antigen were grown in Dulbecco's modified 
Eagle's medium (DMEM) with 10% fetal calf serum 
(Gibco) and G418 (150 ug/ml) (Ewen, M.E, et al , , Cell 
58: 257-267 (1989)). All cells were grown at 37<*C in 
a humidified, 10% CO^^containing atmosphere. 
Radioisotopic labelling of cells and preparation of 
cell lysates was as described previously (Kaelin, ' 
W.G. et al , . Cell 64: 521-532 (1991)), 

Antibodies 

Tissue culture supernatants were the source of 
monoclonal antibodies PAb 419 and M73 (Harlow, E. et 
al., J. Virol . 39: 861-869 (1981); Harlow E. et al . , 
J' Virol. 55: 533-546 (1985)). The use of these 
antibodies for immunoprecipitation and western 
blotting was described previously (Kaelin, W.G. et 
al.. Cell 6±: 521-532 (1991)), except that electro- 
phoretic transfer of proteins to nitrocellulose was 
performed without the addition of methanol to the 
transfer buffer. 

Binding of Fusion Proteins to T Antigen and ElA 

It was also determined that the insertion of the 
RRASV sequence did not demonstrably alter the binding 
behavior of GST-RB fusion proteins, GTK-RB(379-928) , 
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GTK-RB(379-792) , and GTK-RB ( 379-928 ; 706F) , expressed 
from PGEX-2TK. The pRB binding assays for T antigen 
and ElA binding were perf onned essentially as 
described by Kaelin et al . (Kaelin et_al. , Cell 64 : 
521-532 (1991))- 

Fusion proteins GTK-RB (379-928) and 
GTK-RB (379-792) (bound to glutathione-sepharose) 
retained the ability to bind to SV40 T antigen or 
adenovirus ElA in solution. In contrast, no 
significant binding of T antigen or ElA to the mutant 
RB fusion, GTK-RB (379-928 ;706F) , or GST expressed 
from pGEX-2TK was observed under the same conditions. 

Integrity of Phosphorylated Fusion Proteins 

To determine whether the kinase reaction led to 
a significant alteration in the RB T/ElA-binding 
region of the GTK-RB fusion proteins, the following 
experiment was performed • Whole cell lysates 
containing wild-type T antigen (MEF*Tex cells) , the 
RB-binding and transformation defective T antigen 
mutant Kl (MEFi'Kl cells), or ElA (293 cells) were re- 
solved by SDS-polyacrylamide. gel electrophoresis and 
transferred to nitrocellulose filters (See protocols 
in Example 4). GST-RB fusion proteins were kinased 
in vitro and eluted from the glutathione sepharose in 
the presence of reduced glutathione as described 
above. The eluted labeled protein was incubated 
overnight with nitrocellulose strips cut from the 
filters. The filters were then washed and subjected 
to autoradiography. Hybridization conditions and 
washes were as described below in Example 4, 
Hybridization of Filters. 
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Similar to the unphosphorylated version, 
^^P-GTK-RB(379-792) bound to ElA and wild-type T from 
MEF*Tex cells, but not to the T routant Kl from MEF*K1 
cells. The presence of equivalent amounts of T and 
05 Kl in this assay was confirmed by immunoblotting with 
a monoclonal antibody directed against T antigen. 

The binding of ^^P-GTK-RB (379-792 ) protein to 
ElA was inhibited by the presence of a synthetic 
peptide replica of the human papillomavirus E7 

10 RB-binding motif (wild-type E7 residues 16-32). In 
contrast, a point mutant derivative of this peptide 
with a glu to gin change at residue 26, which is 
known to abrogate in vitro RB binding, was inert as 
an inhibitor. In addition, 22p-GTK-RB(379-928) , but 

15 not •^^P-GTK-RB(379-928) ;706F) , exhibited ElA binding 
in this assay. Thus, it appeared that the structural 
integrity of the T/ElA-binding regions in the GST-RB 
chimeras was preserved during the kinase procedure 
and subsequent elution. 

20 The experiments described in this example 

indicate that the expression, behavior during 
purification and structural integrity of different 
polypeptides inserted into pGEX-2TK is not grossly 
affected by insertion of a kinase sequence or by 

25 subsequent phosphorylation. 
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Examole 4 



Screening for Pr oteins Capable of Specific 
Interacti on with the Protein of Interest and 
Isolation of G enes Encoding Such Proteins 



05 



10 



Library Manipulations 

For primary screening, libraries were plated at 
approximately 40,000 pfu/150 mm plate x 30 plates. 
Induction of ^-galactosidase fusion protein 
expression with IPTG impregnated nitrocellulose 
filters was performed as described by Singh etal. 
(Biotechnioues 7; 252-261 (1989)). Additional 
library manipulations including screening with 

P-labelled cDNA probes, purification of specific 
clones, preparation of recombinant phage DNA, and 
15 subcloning of cDNAs into the sequencing vector pBKS 
(Stratagene) were performed using standard 
techniques. 

Preparation of N itrocellulose Filters for 
Hybridization 

Electrophoretic transfer of proteins from 
SDS-polyacrylamide gels to nitrocellulose for western 
blot analysis was carried out in 192 mM glycine, 25 
mM Tris(Base), and 0.01% SDS. Plaque lifts were 
performed as described above. The nitrocellulose 
filters were placed, directly into IX HBB (25 mM 
Hepes-KOH[pH7.7], 25 mM NaCl, 5mM MgCl^ , 1 mM DTT) 
supplemented with 5% non-fat powdered milk and 0.05% 
NP-4 0 without drying and incubated overnight. This 
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and subsequent manipulations were performed at 4*C 
with gentle rocking. The filters were then denatured 
and renatured as described by Vinson et al . ( Genes 
and Dev . 2: 801-806 (1988)), Processing of multiple 
filters was done in batch. 

Following renaturation, the filters were placed 
in fresh IX HBB supplemented with 5% non-fat powdered 
milk and 0,05% NP-40, and incubated for 1 hour. The 
filters were then incubated in IX HBB supplemented 
with 1% non-fat powdered milk and 0.05% NP-40 for at 
least 30 minutes prior to hybridization. 

Hybridization of Filters 

To prepare the hybridization solution, protein 
expression in £. coli (DH5q) transformed with 

15 PGEX-2TK and pGEX-2TK-derived plasmids were induced 
with IPTG as described above in Example 2. The 
bacteria were then pelleted and resuspended in 1/10 
the original culture volume in Hyb 7 5 (2 0 mM Hepes 
[pH 7.7], 75 mM KCl, 0.1 mM EDTA, 2.5 mM MgCl^, 1 mM" 

20 D«pT, 0.05% NP-40) . The bacterial suspension was 

sonicated with a probe type sonicator (Branson) and 
centrifuged at 10,000 x G for 5 min at 4^C. For 
western blots the clarified supernatant was used 
undiluted or diluted 1:2 with Hyb 75. For screening 

25 

plaque lifts the clarified supernatant was diluted 
1:2-1:8 with Hyb 75 supplemented with 1% nonfat 
powdered milk. 

The relevant 32P-labeled GST fusion protein was 
added at 100,000-250,000 cpm/ml. Hybridization was 

30 

carried out at 4®C with gentle rocking overnight, 
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after which the filters were washed three times 
(10-15 min/wash) with Hyb 75 with 1% nonfat powdered 
milk. The filters were then dried, covered with 
Saran wrap and exposed to film at -lO^C with an 
intensifying screen. 

Screening Precleared Lysates for Specifically 
Interacting Proteins 

Previously, a series of GST-RB fusion proteins, 
expressed from pGEX-2T vectors, and non-covalently 
bound to glutathione sepharose, were used to search 
for cellular proteins capable of interacting, 
directly or indirectly, with the pRB T/E1A/E7 -binding 
domain (Kaelin, W.G. et al . . Cell 64:521-532 (1991)). 
At least 7 cellular proteins were identified which 
bound to GST-RB fusion proteins having an intact 
T/EiA/E7-binding domain. These cellular proteins 
failed to bind to GST-RB fusion proteins (with no HMK 
sequence) derived from spontaneously occurring, loss... 
of function RB mutations. In addition, the binding 
of these cellular proteins to GST-RB fusion proteins 
was blocked by a synthetic peptide corresponding to 
the pRB-binding/transf orming sequence found in SV4 0 T 
antigen. In contrast, a point mutant derivative of 
this peptide, corresponding to the sequence found in 
the transformation defective T mutant Kl, did not 
inhibit binding. 

It appeared that one or more of these proteins 
may fulfill the criteria for a meaningful pRB 
cellular ligand. Unfortunately, which of the 
cellular proteins bound directly to the pRB T/E1A/E7- 
binding domain could not be determined from these 
experiments. 
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To demonstrate which of these cellular proteins 
is capable of interacting directly with RB, an 
^^S-labeled WERI-Rb27 retinoblastoma cell lysate was 
prepared. The lysate was precleared by passage over 

05 glutathione sepharose which had been loaded with 

pGEX-2T-encoded GST- to remove proteins which may bind 
to GST rather than the protein of interest. GT- 
RB (379-792) fusion protein was loaded onto the 
glutathione sepharose beads, and the precleared 

10 lysate was incubated with the bound GT-RB (379-792 ) 
fusion protein in the presence (for strip 1) or 
absence (for strips 2-4) of wild-type E7 peptide 
(residues 16-32). Unbound protein was removed by 
washing and the proteins were eluted by boiling in 

15 sample buffer. 

Bound proteins (directly and indirectly bound to 
the RB-boiind beads) were loaded in wide wells, 
resolved by SDS-polyacrylamide gel electrophoresis, 
and transferred to nitrocellulose. The filter was 

20 then cut into adjacent strips and either subjected to 

autoradiography (strips 1 and 2) or probed with 

^^P-GTK-RB(379-792) in the presence (strip 3) or 

absence (strip 4) of the above-mentioned E7 peptide. 

Strips 3 and 4 were then washed, dried, placed under 

3 5 

25 saran wrap (which greatly reduces the S signal) , 
and placed under film. 

At least two bands appeared to be capable of 
interacting directly with the -^^P-GTK-RB ( 379-792 ) 
probe in a peptide inhibitable manner. These bands 
were not detected by "^^P-CTK-RB ( 379-792 ) on strips in 
which the lysate was incubated with GT-RB (379-792) in 
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the presence of wild-type E7 peptide prior to 
electrophoresis, indicating specificity of binding. 
Furthermore, the ability of these two cellular 
proteins to interact with "^^P-GTK-RB ( 379-792 ) in the 
filter binding assay was not inhibited by a point 
mutant (Glu^g to Gln^g) version of the E7 peptide 
defective in RB binding. 

Screening Whole-cell Lvsates for Specifically 
Interacting Proteins 

An unlabeled whole cell lysate prepared from 
WERl-rb27 cells was also probed for cellular proteins 
capable of binding directly to the RB. The unlabeled 
extract was subjected to SDS-polyacrylamide gel 
electrophoresis and transferred to nitrocellulose for 
Western blotting. The filter was cut into strips 
which were probed with selected "'^P-labeled fusion 
proteins. Again, there appeared to be at least 2 
cellular proteins capable of interacting with 

P-GTK-RB fusion proteins in which the T/ElA-bindinq 
region was intact ( P-GTK-RB(379-792) and 
P-GTK-RB (379-928 )pm706-) fusion proteins) ; 

Furthermore, the binding of these cellular 
proteins to wild-type ■^^P-GTK-RB(379-792) fusion 
protein could be selectively inhibited by three 
different RB-binding peptides. Peptide replicas of 
the RB-binding sequences found in T antigen (amino 
acids 102-115), ElA (tyrosine followed by ElA 
residues 115-132). or E7 (amino acids 16-32) each 
inhibited binding of the labeled probe to the two 
cellular proteins. In contrast, point- mutant 
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derivatives of these peptides failed to inhibit the 
interaction of the probe with the filter-bound 
cellular proteins.- The particular mutant peptides 
tested included the T antigen peptide with a Glu^^^ 
to Lys change, the ElA peptide with a Cys^25 
change, and the E7 peptide with a Glu^^ to Gin 
change. Similar results were obtained when this 
assay was performed with a human Burkitt's lymphoma 
cell line (Akata) as well as with normal peripheral 
blood lymphocytes. 

Use of Labeled GST-fusion Proteins as Probes for the 

Isolation of Genes 

In order to isolate cDNAs encoding cellular 

proteins capable of interacting specifically, and 

directly with RB, an Akata Agtll expression library 

was screened with the "^^P-GTK-RB (379-792 ) probe as 

described in the protocols above. Six Agtll clones 

(clones 1, 3, 4, 5, 6 and 9) encoding ^-galactosidase 

fusion proteins which bound to ^^P-GTK-RB (379-792 ) 

with high affinity were plague purified and subjected 

to further analysis. 

The binding of the fusion proteins encoded by 5 

of these clones (clones 1, 4, 5, 6 and 9) to the 
32 

P-GTK-RB( 379-792) probe was markedly reduced or 
undetectable in the presence of the wild-type E7 
peptide, while the mutant E7 peptide had no effect on 
binding. Furthermore, the fusion proteins encoded by 
all of the clones bound readily to 
"^^P-GTK-RB(379-928) , whereas binding to 
^0 ^^P-GTK-RB(379-928;706F) , a protein with a mutation 
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10 



in the T/E7/E1A binding region of rb, was 
undetectable. Thus, binding to these clones 
displayed RB binding behavior similar to cellular 
proteins which interact with RB. 

Cross-hybridization experiments, and subsequent 
sequence analysis, demonstrated that 4 of the clones 
(Clones 1, 3, 4 and 6) contained overlapping cDNA 
fragments derived from a common mRNA. The gene 
encoding this mRNA will be referred to as RBAPl. 

Sequence analysis of clone 5 predicted that it 
encoded a /S-galactosidase leader polypeptide fused to 
the sequence His-Ser-Phe-Leu-Leu-Cys-Asp-Glu-Asn-Val- 
Leu-Asp-stop. Thus, this fusion protein contains the 
Leu-X-Cys-X-Glu motif common to the viral RB-binding 
motifs. Additional cDNA clones related to clone 5 
were obtained by screening a 293 cell library with 
the clone 5 insert cDNA. Sequence analysis of these 
clones suggested that the short open reading frame in 
clone 5 was generated by the juxtaposition of a 
normally untranslated cDNA segment with the Agtll 
^-galactosidase coding sequence. Thus, while the 
clone may not represent a cellular protein, upon 
expression in Agtii, it encodes a protein with 
properties (e.g., a Leu-X-Cys-X-Glu motif) common 
with viral RB-binding motifs. 

Clone 9, unlike clone 5, contained a long open 
reading frame, consistent with the possibility that 
it encodes an RB-interacting protein. 
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Additional Characterization of RBAPl 

The retinoblastoma gene product is believed to 
serve as a cell cycle regulatory eleirient. In 
particular, pRB is thought to contribute to the 
^5 regulation of cell cycle progression as cells 
traverse the Gl/S boundary. Consistent with a 
possible role as an RB-interacting protein, RBAPl 
message levels appear to respond to cell, cycle 
events . 

A Northern blot was prepared of total RNA 
obtained from resting peripheral blood lymphocytes 
(PBL) and from PEL at various time points following 
stimulation with a cocktail containing PMA, PHA, and 
a calcium ionophore. The Northern blot was probed 
with an RBAPl probe from clone 4. A single message 
of about 3.5 kb became detectable 24-36 hours after 
stimulation. 

In a similar experiment, PBL were stimulated to 
enter the cell division cycle in the presence of 

20 hydroxyurea (HU) . Again, a single 3 . 5 kb mRNA was 
detected with the RBAPl probe. The abundance of the 
message in RNA from HU-treated PBLs increased to a 
maximum at 3 6 hours, and then appeared to plateau. 
Furthermore, the level of the mRNA in cells detected 

25 by the RBAPl probe fell dramatically within 8 hours 
after removal of HU. 

In vitro Binding of RBAPl to pRB 

The RBAPl cDNA was cloned into pGEX-2T and 
expressed in E. coli as a GST-fusion protein. 
Glutathione sepharose beads were loaded with the 
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GST-RBAPi fusion. "^^S-labeled RB proteins were 
prepared by in vitro transcription and translation 
from cDNA clones as described (Kaelin, W.G. et_al. , 
Mol. Cell. Biol. 10(7) : 3761-3769 (1990)). The 
labeled RB proteins, RB (37 9-928) and 
RB (379-928 ;pm706) were incubated with the bound 
GST-RBAPl fusion protein. The beads were washed and 
analyzed for the amount of fusion protein retained. 
Quantitative recovery of the in vitro translated 
^0 _^^P-GTK-RB(379-928) protein, but not of the 
P-GTK-RB(379-92 8;pni7.06) fusion protein was 
observed. This experiment further suggests that the 
RBAPi product is capable of interacting directly with 
the RB gene product. 

Expression of pi07 as a GTK-fusion Protein 

The RB-related protein pl07 was also expressed 
as a GTK-fusion protein. A cDNA encoding a region of 
pl07 which is homologous to the T/E1A/E7 binding 
domain of the RB gene product was inserted into 
PGEX-2TK. Using the protocols described in the above 
examples, a GTK-pi07 fusion protein was expressed in 
E. coli, a lysate was prepared, the fusion protein 
present in the lysate was captured on a 
glutathione-sepharose affinity column, and labeled 
with [ P] in situ in a phosphorylation reaction 
using the catalytic subunit of cAMP-dependent protein 
kinase and [t-^^PJATP. The t^^P]-labeled GTK-pi07 
fusion protein was eluted from the column, and was 
used as probe in Western blot analysis of whole cell 
extracts. Whole cell extracts (unlabeled) containing 
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ElA were prepared from 293 cells, subjected to 

SDS-polyacrylamide gel electrophoresis, and electro- 

phoretically transferred to filters. Filters were 

cut into strips and probed with (1) anti-ElA 

05 antibody, (2) [^^P]-labeled GTK-pl07 fusion protein, 

(3) ["^^P] -labeled GTK-pl07 fusion protein in the 

presence of competing wild-type E7 peptide (residues 

16-32), and (4) [ "^^P] -labeled GTK-pl07 fusion protein 

in the presence of mutant E7 peptide (residues 16-32, 

iO with a Glu to Gin mutation at residue 26) . 

32 

Autoradiography indicated that the [ P] -labeled 

GTK-pl07 fusion protein, alone or in the presence of 

mutant E7 peptide, was able to detect the ElA product 

on the blots. The specificity of interaction was 

3 2 

15 indicated by the observation that the [ P] -labeled 
GTK-pl07 fusion protein did not detect the ElA 
product in the presence of competing wild-type E7 
peptide . 

Example 5 

20 Construction of pAR(/^RI) 59/60 and Properties of 

pFEK-fusion Proteins 

Construction of pAR(ARI) 59/60 

Plasmid pAR3040 (also referred to as pET3a) was 
the starting material (Studier, Meth. Enzymol . 185 : 
60-89 (1990)). This plasmid and expression plasmids 
derived from this vector can be maintained and 
induced for expression as described by Studier 
(Studier, Meth. Enzymol .185 : 60-89 (1990)). Plasmid 
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PAR3 04 0 was cut with EcoRI. The overhangs were made 
blunt by "filling in" using Klenow enzyme and dNTPs, 
and religated to destroy the EcoRI site. In an 
infrequent event, the EcoRI site was regenerated on 

'5 propagation of the plasmid. Therefore, another 
version of of this intermediate was constructed by 
treating with mung bean nuclease to reinove the EcoRI 
overhangs prior to religation. Regeneration of the 
EcoRI site in the latter version was not observed. 

0 Both versions of the intermediate construction were 
modified as described below, to make two slightly 
different pAR(ARI) 59-60 vectors (i.e., differing in 
the manner in which the EcoRI site was destroyed) • 
These vectors behave identically with respect to 

^ expression of encoded fusion proteins. The 

intermediate vectors were then cleaved with Ndel and 
treated with calf intestinal alkaline phosphatase 
(CIP). 
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Two complemenrary oligonucleotide adaptors, MAB 
59 and MAB 60, were synthesized, kinased, and 
annealed. The adaptors were then ligated into the 
Ndel cleaved, CIP--treated vectors. The adaptors 
05 introduce the 17 amino acid sequence comprising the 
FLAG peptide and HMK site shown in Figure 2. The 
sequence of the adaptors is shown below: 



Met Asp Tyr Lys^Asp Asp Asp Asp Lys Ala Arg Arg- 

5'-T ATG GAC TAC AAA GAC GAT GAC GAT AAA GCA AGA AGA- 
10 AC CTG ATG TTT CTG CTA CTG CTA TTT CGT TCT TCT- 

Ala Ser Val Gin Phe- 

GCA TCT GTG GAA TTC CA 

CGT AGA CAC CTT AAG GT AT- 5' 

Plasmids having one insert of the double-stranded 
^5 adaptor were identified by restriction analysis of 
miniprep DNA. An Xbal-BamHI double digest releases a 
fragment comprising the inserted sequence. • The 
structure of resulting plasmids, named pAR(ARI) 59/60 , 
was confirmed by dideoxy sequencing using 
oligonucleotide primers MAB 58 
(5'-GCAGCCAACTCAGCTTC-3' ) and MAB 69 
( 5 ' -TTAATACGACTCACTAT-3 ' ) . 

Construction of Derivatives of pAR(ARI) 59/60 

Four derivatives of pAR(ZiRI) 59/60 were prepared. 
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The first derivative encoding a selected polypeptide 
was prepared by the insertion of a 1.5 kb EcoRI 
fragment of a shPan-1 (N3) cDNA into pAR(ARI) 59/60 , 
which had been cleaved with EcoRI and treated with 
°^ CIP. The shPan-1 (N3) cDNA encodes a DNA binding 
protein, as described by German et al . (German et 
al. » Molec. Endocrinol . 5: 292-299 (1991)). 

Plasmids containing the insert were detected by 
restriction analysis of miniprep DNA using a 
BamKI-stuI double digest. The correct orientation 
yielded a diagnostic 500 bp fragment. The sequence 
at the junctions between the vector and insert was 
verified by sequencing, using oligonucleotide priners 
MAB 58 and MAS 69 (see above) . The plasaid is 
referred to herein as pFEK-shPan-l. 

The second derivative of pAR(ARl) 59/60 was made 
by the insertion of N3-SH, a fragment of shPan-l 
generated by PCR (German et al .. Molec. Endocrinol . 
5: 292-299 (1991)). For cloning, the fragment was 
generated by PCR, using oligonucleotide primers MAB 
72 (5'-GGCCGAATTCTCCTGGTCCCACGGAGACCC-3' ) and MAB 73 
( 5 ' -GGCCGAATTCGCCGAGGAGGACAAGAAGGACC-3 ' ) . The PCR 
products were purified on an agarose gel, 
electroeluted, treated with T4 DNA polymerase, and 
digested with EcoRI. The resulting fragment was 
ligated to EcoRl-cut, CIP-treated pAR (aRI) 59/60 . 
Constructs with single inserts were identified by 
Xbal-BamHI double digests of miniprep DNA. The 
junctions and sequence of the insert were verified by 
30 dideoxy sequencing using oligonucleotides MAB 58 and 
MAB 69 (see above). The resulting construct is 
referred to herein as pFEK-N3-SH. 
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The third derivative of pAR (z^RI) 59/60 was 
constructed by the insertion of a DNA sequence 
encoding amino acids 12 0 through 206 of the rat c-fos 
protein (Kouzarides, T. and E. Ziff, Nature 336 : 
^5 646-651 (1988); Curran, T. et al . , Oncogene 2: 79-84 

(1987) ; Nakabeppu, Y, and D. Nathans, EMBO J . 8: 
3833-3841 (1989)). The specific fragment used for 
cloning was generated by the polymerase chain 
reaction (PGR) / using oligonucletide primers MAB 70 
(5'-GGCCGAATTCGCGCAGAGCATCGGCAGAAG-3' ) and MAB 71 
(5' -GGCCGAATTCCTACTAGATCTTGCAGGCAGGTCGGT-3 ' ) . The 
resulting PGR products were purified on an agarose 
gel, electroeluted , treated with T4 DNA polymerase, 
and digested with EcoRI . The resulting fragment was 
ligated into EcoRI-cut, CIP-treated pAR(riRI) 59/60. 
Isolates with single inserts were identified by Xbal 
and BamHI double digests. The junctions and insert 
sequence were verified by dideoxy sequencing using 
the oligonucleotide primers MAB 58 and MAB69 (see 

20 

above) • The resulting plasmid is referred to herein 
as pFEK-c-fos (120-206) . 

The fourth derivative of pAR{ARI) 59/60 was 
derived by the insertion of amino acids encoding 
residues 206 through 340 of the human c-jun- protein 
(Kouzarides, T. and E. Ziff , Nature 336 : 646-651 

(1988) ; Bohmann, D. et al . , Science 238 : 1386-1392 
(1987)), The fragment for cloning was generated by 
PGR using oligonucleotide primers MAB 74 
{5'-GGCGGAATTGTTTCCCGCGGAACCGGAGGA-3 ' ) and MAB 75 

30 (5'-GGCGGAATTCCCGACGGTCTCTCTTCAAA-3 ' ) . The PGR 
products were purified on an agarose gel, 
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electroeluted, treated with T4 DNA polymerase, and 
digested with EcoRI. The resulting fragment was 
ligated to EcoRI-cut, CIP-treated pAR(ARI) 59/60. 

Constructs having a single insert were again 
identified by Xbal-EamHI digestion of mini-prep DNA. 
The junctions and insert sequences were confirmed by 
dideoxy seuqencing using primers MAB 58 and MAB 69 
(see above). The resulting plasmid is referred to 
herein as pFEK-c-jun(206-340) . 

pFEK-f usion Proteins Retain Biological Activity and 
Are Efficiently Phosphorylated 

As each of the selected polypeptides expressed 
from pAR(ARI) 59/60 is a DNA binding protein, the 
activity of the selected polypeptide portions, in the 
context of a fusion protein comprising an affinity 
ligand and modification site, were assayed by 
electrophoretic mobility shift assays (EMSA) . Assay 
conditions for the FEK-shPAN-l fusion protein encoded 
by pFEK-shPan-1 and by pFEK-N3-SH, were as described 
by German et al . (German et al . , Molec." Endocrinol . 
5: 292-299 (1991)). EMSA conditions for the fusion 
protein encoded by pFEK-c-fos ( 120-206) were as 
described (Kouzarides, T. and E. Ziff , Nature 336 : 
646-651 (1988); Curran, T. et al . , Oncogene 2: 79-84 
(1987); Nakabeppu, Y. and D. Nathans, EMBO J . 8: 
3833-3841 (1989)). 

EMSA conditions for the fusion protein encoded 
by pFEK-c-jun(206-340) were as described (Kouzarides, 
T. and E. 2iff, Nature 336: 646-651 (1988); Bohmann, 
D. et al . , Science 238: 1386-1392 (1987)). The DNA 
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binding activity of the FEK-c-jun (206-340) fusion 
protein present in bacterial extracts, was assayed in 
combination with bacterial extracts containing fos 
•'core" (to assay binding of jun as a jun-fos 

05 heterodimer) . 

The four unlabeled fusion proteins were made by 
in vitro transcription/translation. The particular 
DNA probes used in each shift assay were radiolabeled 
for detection of DNA-fusion protein complexes. As a 

10 positive control, each of the native (i.e., not 

fused) DNA binding proteins was prepared by in vitro 
transcription/translation and subjected to the shift 
assay. The short polypeptide produced by 
pAR(ARI) 59/60, and the transcription/translation 

15 products from a vector control for each 

pFEK-construct in which the selected polypeptide 
insert was in a reverse orientation, provided 
negative controls for DNA binding. As indicated by 
the shift assay, all four pFEK-fusion proteins bound 

20 DNA to an extent comparable to the corresponding 
native protein. These results indicate that the 
incorporation of the selected polypeptides (fos, jun, 
shPah-1 (N3), and N3-SH) into fusion proteins of the 
present invention comprising a modification 

^5 recognition site, did not alter the biological 
activity (DNA binding) of these proteins. 

In parallel to the experiment just described, in 
vitro transcribed and translated fusion proteins were 
modified by phosphorylation with HMK kinase and 

^0 non-radioactive ATP (see below for conditions) . 
Modification by phosphorylation could be 
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distinguished because the negative charge of the 
additional phosphate group on the modified fusion 
protein led to an alteration in migration of the 
DNA-fusion protein complex as compared with the 
unmodified fusion protein-DNA complex. Modification 
of all four of the fusion proteins did not alter the 
extent of complex formation, indicating that the 
biological activity of the selected polypeptide 
portion is not altered by modification. 

Inducti on of Protein Expression in BL21 Bacteria 

Induction of expression from pAR(ARI) 59/60 
and pFEK- plasmids was done essentially as described 
by Studier et al. (Studier, Meth. Enzvmol . 185 : 60-89 
(1990)). Briefly, plasmids were transformed into 
competent BL21 or BL21 pLysS bacteria. A single 
colony was inoculated into Luria broth with 
ampicillin at 50 ug/ml and with chloramphenicol at 25 
ug/ml. Growth at BV'C was allowed to proceed until 
an optical density (Ag^^) of approximately 1.0 was " 
attained. IPTG was added as described by Studier 
(Meth. Enzymol. 185: 60-89 (1990)). Cells were grown 
for another 3 hours or so at 37»c. A bacterial 
lysate was prepared using standard techniques. 

Following expression in E. coli , the 
unfractionated bacterial extract from transf ormants 
carrying expression vector pFEK-shPan-i , pFEK-N3-SH, 
pFEK-c-fos ( 120-206) , or pFEK-c-jun (206-340) was 
assayed by electrophoretic mobility shift assay as in 
the previous section. The results indicated that the 
unmodified or modified (by phosphorylation with 
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unlabeled ATP) FEK-fusion proteins expressed in 
bacterial extracts retained the biological activity 
of the selected polypeptide portion. 

Partial Purification of Proteins Produced : 
05 Following expression in E. coli , bacterial 

extracts from cells transformed with a pFEK- vector 
were fractionated to obtain partially purified 
FEK-fusion proteins by one of two methods - 

DEAE-Sephacel chromatography: 

-used as per directions of supplier 

(Pharmacia-PL) 
-buffer contained 100 mM NaCl and 10% glycerol 
(in addition to the buffer constituents 
presents in the bacterial lysates) 
-treated in "batch" for 60 minutes at h-4*C 
-lysate was incubated with resin with 
continuous gentle agitation 
-supernatant solution passed over a 2 ml 

column of resin 
-flow-through collected 

The fusion proteins encoded by vectors 
pFEK-shPan-1 and pFEK-N3-SH were obtained using 
DEAE-Sephacel chromatography at -50% purity. 

B) Hepar in-Sepharose chromatography : 
-5 -used as per directions of supplier 

(Pharmacia-PL) 



A) 

10 
15 
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-DEAE flow-through fraction from protocol (A) 
was loaded onto a 1 ml bed volume column of 
resin 

•resin washed with 10 column volumes of load 
05 buffer (i.e., 100 mW salt) 

-proteins were eluted with 1 column volume each 
of 200, 300, 400, 500, 600, 700, 800, 900 and 
1000 mM salt 
-fractions were tested for the presence of 
10 active protein by the appropriate assay 

For example, in the case of a fusion protein 
comprising the fos 'core' (encoded by 
pFEK-c-f OS (120-206) ) , activity of fractions was 
assayed by electrophoretic mobility shift assay 
15 together with reticulocyte-produced c-jun protein. 
Peak activity for the fusion protein comprising the 
fos 'core' fusion protein was eluted at approximately 
500-600 mM salt yielding a preparation with -50% 
purity. 

^0 Phosphorylation of Proteins with HMK : 

All four fusion proteins produced by the pFEK- 
vectors (pFEK-shPan-1 , pFEK-N3-SH, 

pFEK-c-f OS (120-206) , and pFEK-c-jun (206-340) ) were 
capable of being modified by phosphorylation when 
present in crude bacterial extracts. As discussed 
above, modification by phosphorylation with unlabeled 
ATP was observed. In addition, modification using 
radioactively-labeled ATP was observed. 
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Proteins which were partially purified by the 
methods (A and B) described above were also 
efficiently radiolabeled by phosphorylation vising KMK 
enzyTne and [7-'^P]ATP. Labeling to a soecific 
05 activity of approxir.ately 10^ to lo^ cpm per ,q of 
protein was observed for partially purified 
FEK-fusion proteins encoded by pFEK-shPan-l , 
PFZX-N3-SH, and pFEK-c-fos (120-206) . The protocols 
used for protein labeling are described below. 

P-'^osDhorvla tion with Heart Muscle K,-n^.= ^ . 

Sigma # p-2645 Protein Kinase, Catalytic subunit 
from bovine heart was obtained as lyophilized powder 
m 250 unit vials. Typically, 250 units of hkk 
enzyrie was resuspended in 25 >,1 of 40 aM DTT (i.e. 
13 at 10 u/Ml). The solution was allowed to stand at' 
room temperative for 10 minutes and was stored at +4 
"C. Activity was stable for 2-3 days, but KMK was 
usually freshly reconstituted for each use. 

A lOx preparation of HMK Buffer was prepared 
(200 mM Tris-Cl (pH 7.5), lo mM DTT, 1 M NaCl, and 
120 Mgci^). The phosphorylation reaction mixture 
was as follows: 



20 



3 All (lOx) HMK buffer 

^"^ 32 (e.g., NEN-Dupont #NEG-035C 

tT P]-ATP >7000Ci/mmol) 
1-10 ul of protein extract (amount will vary with 

particular protein) 
1 ul of 10 u/pl HMK . 
water to a total of 3 0 ^^1 
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The reaction mixture was incubated at 37 "c for 30-60 
Minutes, and was stored on ice until use. Some 
proteins were labeled almost as well when the 
incubation was carried out on ice. The low 
temperature incubation can serve to preserve the 
biological activity of the modified fusion protein. 
These procedures can be scaled up as required. 



05 



10 



15 



G50 column chrom atography to Remove Unincorporated 
Label 

FEK-fusion proteins were used for Western 
blotting or for plaque screening. For these 
applications, the fusion protein was run a G50 column 
as described below in order to remove unincorporated 
label following the Hl^K phosphorylation reaction. 



"Z + 0.1 M KCl" buffer (25 mM Hepes-KOH ([pH 
7.7], 12.5 mM MgCl^, 20% glycerol, 100 mM KCl) was 
prepared. For solutions comprising Z + KCl, and BSA, 
the solution was filtered through a 0.2 filter 
prior to use. DTT was added to a final concentration 
20 of 1 mM just prior to use. 

— G50 fine (pre-swollen in water) was washed (ix) 

with 5 ml of (Z + 0.1 M KCl + 3 mg/ml BSA + i mM 
DTT) and resuspended at (i:i] in the same buffer 

-the beads were rotated at RT for -1 hour, and washed 
three times with (Z + 0.1 M KCl + i mg/ml BSA f 
1 mM DTT) 

-a 1 ml sterile plastic pipette was packed with the 
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washed G50 (bed vol -1.2 ml) and equilibrated at 

room temperature with -50-10 ml of (2 -r o.l M 

KCl + 1 mg/ml BSA + i mM DTT) 
-immediately prior to loading the column, the 

total volume of the HMK reaction was brought to 

100 ^1 with ice-cold (Z + 0.1 M KCl + 1 mM DTT) 
-the column was loaded and run at room temperature, 

taking 1 drop fractions (-45 ^l/drop) 
-fractions were stored immediately on ice 
-2-5 pi aliquots were removed from each fraction and 

counted by Cerenkov counting 
-the excluded peak fractions (usually at 1/3 to 1/2 

the column volume) were pooled 

Optionally aliquots of the fractions (e.g., 1.0 pi) 
can be fractionated on SDS-polyacrylamide gels to 
monitor the progress of the fusion protein. The 
above procedure can also be carried out at 4 C where 
desired- . 

Equivalents 

Those skilled in the art. will be able to 
recognize, or be able to ascertain, using no more 
than routine experimentation, many equivalents to the 
specific embodiments of the invention described 
herein. Such equivalents are intended to be 
encompassed by the following claims. 
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CLAIMS 

1. An expression vector comprising a nucleotide 
sequence which encodes at least one affinity 
ligand, at least one modification recognition 
sequence and at least one restriction site for 
the in frame insertion of a nucleotide sequence 
encoding a selected polypeptide. 

2. The expression vector of Claim 1 wherein the 
affinity ligand is a glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, and the modification recognition 
sequence is a phosphorylation site. 

3. The expression vector of Claim 1 further 
comprising a nucleotide sequence which encodes a 
cleavable linker located between the affinity 
ligand and the restriction site for insertion of 
a selected polypeptide. 

4. The expression vector of Claim 1, wherein the 
affinity ligand is a glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, the modification recognition sequence is 
a phosphorylation site, and the cleavable linker 
is selected from the group consisting of a 
thrombin cleavage site. Factor Xa cleavage site, 
and enterokinase cleavage site. 
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5. The expression vector of Claim 4, wherein the 
vector is pGEX-2TK. 

6. The expression vector of Claim 4, wherein the 
vector is pAJ^(^^RI) 59/60. 

7. An expression vector comprising a nucleotide 
sequence which encodes a fusion protein 
comprising an affinity ligand, a modification 
recognition sequence and a selected polypeptide. 

S. The expression vector of Claim 7, wherein the 
affinity ligand is a glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, and the modification recognition 
sequence is a phosphorylation site. 

9. The expression vector of Claim 7 wherein the 
nucleotide sequence further encodes a cleavable 
linker located between the affinity ligand and ' 
the selected polypeptide. 

10. The expression vector of Claim 9, wherein the 
affinity ligand is glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, the modification recognition sequence is 
a phosphorylation site, and the cleavable linker 
is selected from the group consisting of a 
thrombin cleavage site, factor Xa cleavage site 
and enterokinase cleavage site. 
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11. A inethod of producing a labeled fusion protein 
comprising: 

a) expressing a fusion prorein comprising an 
f affinity ligand, a phosphorylation 

05 recognition sequence and a polypeptide in a 

host cell using an expression vector; 

b) lysing the host cells to obtain a lysate 
containing the fusion protein; 

c) contacting the lysate with an affinity 

10 matrix under conditions sufficient to bind 

the affinity ligand portion of the fusion 
protein to the affinity matrix; 

d) washing the product of step (c) to remove 
unbound material; and 

15 e) contacting the bound fusion protein with 

[7-labeled] ATP and a kinase reaction buffer 
comprising a protein kinase under 
conditions appropriate for phosphorylation 
of the bound fusion protein to thereby 

20 produce a labeled fusion protein which is 

bound to the affinity matrix. 



12. The method of Claim 11 further comprising the 
steps of: 

f) optionally stopping the reaction of step 
(e) by addition of a stop buffer; 

g) washing the affinity matrix with 
labeled fusion protein bound thereto to 
remove unincorporated label; and 



25 
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h) eluting the labeled fusion protein 

with a suitable elution buffer comprising a 
release component thereby obtaining a 
labeled fusion protein, 

13. The method of Claim 11 further comprising the 
steps of: 

f) optionally stopping the reaction of step 
(e) by addition of a stop 
buffer; 

10 g) washing zhe bound labeled" fusion protein to 

remove unincorporated label; and 
h) cleaving the labeled fusion protein with a 
site specific protease to produce a labeled 
selected polypeptide portion. 

15' 14. A method of producing a labeled fusion protein 
comprising: 

a) expressing a fusion protein comprising a 
GST affinity ligand, a- phosphorylation 
recognition sequence and a polypeptide in a 

20 host cell using an "expression vector; 

b) lysing the host- cells to obtain a 
lysate containing the fusion protein; 

c) contacting the lysate with immobilized 
glutathione under conditions sufficient to 

25 bind the GST portion of the fusion protein 

to bind to glutathione, thereby obtaining 
immobilized glutathione with the bound 
fusion protein; 
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d) washing the inmofailized glutathione with 
bound fusion protein to remove unbound 
Siaterial; and 

e) contacting the bound fusion protein with 
[7-labeled]ATP and a kinase reaction buffer 
coTTiprising a protein kinase under 
conditions appropriate for phosphorylation 
of the bound fusion protein to thereby 
produce immobilized glutathione with 
labeled fusion protein which is bound to 
immobilized glutathione. 

15. The method of Claim 14 wherein the protein 
kinase is the catalytic subunit of a 
cAMP-dependeht protein kinase. 

16. The method of Claim 14 wherein the [7-labeled3 ATP 
is .[7-'^^P]ATP. 

17. The method of Claim 14 further comprising the 
steps of: 

f) optionally stopping the reaction of step 
■(e) by addition of a stop buffer; 

g) washing the immobilized glutathione with 
bound labeled fusion protein to remove 
unincorporated label; and 

h) eluting the bound labeled fusion 
protein with a solution comprising reduced 
glutathione thereby obtaining a labeled 
fusion protein. 
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18. The method of Claim 14 further comprising the 
steps of: 

f) optionally stopping the reaction of step 
(e) by addition of a stop buffer; 
05 g) washing the immobilized glutathione with 

bound labeled fusion prorein to remove 
unincorporated label; and 
h) cleaving the labeled fusion protein with 
thrombin to produce a labeled selected 
10 polypeptide portion. 

19. A kit for preparing a radiolabeled 
glutathione-S-transf erase (GST) fusion protein 
comprising: 

a) an immobilized glutathione affinity matrix; 
15 b) a wash buffer which permits binding of a 

GST-fusion protein to the affinity matrix; 

c) a kinase reaction buffer; 

d) a protein kinase preparation; 

e) an elution buffer comprising reduced 
20 glutathione for releasing the fusion 

protein from the affinity matrix; and 
optionally 

f) a reaction stop buffer. 



20. A labeled fusion protein comprising an affinity 
ligand, a modification recognition sequence, and 
a polypeptide. 
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21. The labeled fusion protein of Claia 20, wherein 
the affinity ligand is a 

glutathione-S-transferase affinity ligand and 
the modification recognition sequence is a 
phosphorylation recognition sequence. 

22. The labeled fusion protein of Claim 20, wherein 
the affinity ligand is the FLAG epitope, and the 
modification recognition sequence is a 
phosphorylation recognition sequence. 

23. A host cell containing the expression vector of 
Claim .1. 
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FLAG- 
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Protein 



Enter okinase 

FLAG peptide 



Ala 



■^50 Met Asp Tyr Lys Asp Asp Asp Asp Lys 

. . AAGGAG ATATA CATATG GAC TAG AAA GAG GAT GAG GAT AAA GCA 

Ndel 

HMK recognition 



Arg Arg Ala Ser Vol. Glu Phe 
AGA AGA GCA TOT GTG GAA TTC CATATG 

EcoKl Ndel 

V ^ 
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